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In  addition  to  expressing  our  appreciation  of  the  sponsorship 
provided  by  the  Cooperating  Societies,  the  organisers  of  OC'94 
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The  Proceedings  of  the  Conference  will  be  published  by 
the  UK  Institute  of  Physics  Publishing. 
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Monday  22  August,  8.46  - 10.30  a.m. 

8.46  sjh. 

INTRODUCTION 

B.  S.  Wherrett  Hanoi- Watt  Univ  UK 

S.  D.  Smith  Heriot-Watt  Univ  UK 

C.  Dainty  Imperial  College  . 


Planary 

(Session  Chain  S.  D.  South,  Heriot-Watt  Univ,  UK) 

9.00  a.m. 

MAI  INVITED  -  Quantum  Wall  Smart  Pixels  for  Optical  Switching  and  Processing,  DAB. 
Miller.  AT&T  Bell  Labs,  U.SA.  A  broad  variety  of  experimental  smart  circuits  with  quantum  well 
optical  inputs  and  outputs  has  been  fabricated  for  many  users.  Current  status  and  future 
prospects  for  such  large  scale  optoelectronic  integration  is  summarized. 

9  JO  a.m. 

MA2  INVITED  -  Largs  Scale  Integration  of  LEDs  and  Gsas  circuits  Fabricated  Through 
MOSIS,  A.C.  Grot,  D.  Psaltis,  K.V.  Shenoy*  and  C.G.  Constad,  ,*  California  Ins*,  of  Tech., 
U.S.A.  (*MIT,  Cambridge,  U.S.A.).  We  describe  a  new  process  to  integrate  LEDs  with  fully 
processed  GaAs  circuits  using  MBE  regrowth.  The  bottom  contact  of  tne  LED,  grown  in  the 
dielectric  vias,  is  made  through  the  n+source/drain  implant. 

9.45  a.m. 

MA3  INVITED  -  Parallel  Algorithms  for  Optical  Processors,  P.  Chavel,  Inst.  d'Cptique,  Paris, 
France.  Fine  grain  parallel  optoelectronic  processors  for  dedicated  tasks  may  deserve  further 
consideration.  The  main  case  is  vision  machines;  stochastic  algorithms  working  on  images 
considered  as  Markov  fields  are  one  possible  approach  to  tackle  real  images. 

10.15  a.m. 

MA4  INVITED  -  Quantum-Statistical  Restrictions  on  the  Information  Transmitting/ 
Processing  Rate  in  Electronic  and  Photonic  Charmsla,  F.V.  Karpushko,  DOPIT,  Academy  of 
Sciences  of  Beianjs,  Minsk,  Belarus.  Quantum  effects  are  shown  to  limit  the  information 
capacity  of  electronic  channels  by  -  1012bit/s.  At  higher  rates  the  energy  price  for  transmitting 
information  is  2-4  order  less  in  ID-boson  channels  as  compared  with  1  D-fermion  ones.  The  3D- 
optical  channels  are  free  of  quantum  restrictions  up  to  information  rates  -  lO^nit/cnr^s. 

10 JO- 11.00 a.m.  Coffee  Break 


Monday  22  August,  11.00  - 12.30  pjn. 

Plenary 

(Session  Chair.  J.Lde  Bougrenet,  ENST  de  Bretagne,  France) 

11.00  ajn. 

MSI  INVITED  -  SpatisHJght  Modulator  Based  Routing  Systems,  W.A.  Crossland,  R.J. 
Meats  and  S.T.  Warr,  Univ.  Cambridge,  U.K.  This  paper  reviews  the  principles  of  SLM  based 
free  space  photonic  switch  structures  based  on  shadow  logic  and  on  dynamic  holograms 
configured  for  optical  fibre  networks  for  parallel  optoelectronic  machines  and  highly  parallel  chip- 
to-chip  interconnections. 


IV 


11.30  a.m. 

MB2  INVITED  -  Independently  Addressable  Vertical-Cavity  Surface-Emitting  Laser  Diode 

Arrays,  E.  Zeeb,  B.  Mfiller,  T.  Hackbarth*,  H.  Leier*  and  KJ.  Ebeling,  University  of  Ulm, 
Germany  (' Daimler  Benz  Research  Center,  Ulm,  Germany).  The  fabrication  of  very  uniform, 
planar  10  x  10  vertical-cavity  surface-emitting  laser  diode  arrays  is  reported.  Individual  elements 
show  threshold  currents  around  4  mA,  emit  up  to  300  pW  single  mode,  and  exhibit  a  3  dB 
modulation  bandwidth  of  4.6  GHz. 


11.45  a.m. 

MB3  INVITED  -  The  Scottish  Collaborative  Initiative  on  Optoelectronic  Sciences  -  Devices 
and  Demonstrators  for  Free-Space  Digital  Optical  Processing,  A.C.  Walker,  D.J.  Goodwill, 
B.S.  Ryvkin  M.  McElhinney,*  F.  Pottier*.  B.  Vogele*,  M.C.  Holland*  and  C.R.  Stanley*,  Dept. 
Phys.,  Heriot-Watt  Univ.,  Edinburgh,  U.K.  ('Dept.  E&E  Eng.,  Univ.  Glasgow,  U.K.).  16  x  16 
arrays  of  strained-InGaAs/GaAs  S-SEEDs  have  been  successfully  operated  with  a  diode- 
pumped  1064  nm  laser.  These,  and  other  GaAs/GaAIAs  devices,  are  being  incorporated  into 
digital  parallel  optical  processing  demonstrators.  The  status  of  this  programme  will  be  reviewed. 

12.15  p.m. 

MB4  INVITED  -  Smart  Pixel  Arrays:  Challenges  of  Manufacturing  and  Testing,  G.  Livescu, 
L.M.F.  Chirovsky,  L.A.  D  Asaro,  S.  Hui,  R.A.  Novotny*,  A.L.  Lentine*,  M.W.  Focht**,  J.M. 
Freund**,  K.G.  Glogovsky**,  G.D.  Guth**,  G.  Przybylek**  and  R.E.  Leibenguth**,  AT&T  Bell 
Labs.,  Murray  Hill,  U.S.A.  ('AT&T  Bell  Labs.,  Naperville,  U.S.A.,  "AT&T  Bell  Labs.,  Breinigsville, 
U.S.A.).  Wafer  level  optical  and  electronic  testing  of  smart  pixel  arrays  is  a  must  if  a  high  yield 
technology  is  to  be  developed.  Our  results  on  GaAs  FET-SEED  based  switching  nodes  show 
the  uniformity  and  performance  control  levels  achievable  with  today's  technology,  as  well  as  its 
limits. 


12.30  -  2.00  p.m.  Lunch  Break 


Monday  22  August,  2.00  -  3.45  p.m. 

Smart  Pixels  and  SLMs 

(Session  Chain  J.  Hegarty,  Trinity  College,  Dublin.  Ireland ) 


2.00  p.m. 

MCI  Monollthlcally  Integrated  Optoelectronic  Smart  Pixels,  U.  Kehrii,  D.  Leipoid,  K.  Thelen, 
H.P.  Schweizer,  P.  Seitz  and  B.D.  Patterson,  Paul  Shorter  Institut,  Zurich,  Switzerland.  We 
describe  a  monolithicalty  integrated  optoelectronic  smart  pixel  fabrication  process  using  a 
GaAs/AIGaAs  system.  The  smart  pixels  are  realised  with  depletion  type  metal  semiconductor 
field-effect  transistors  (MESFETs),  high  brightness  light-emitting  diodes  (HBLEDs)  and 
photodiodes  (PDs). 

2.15  p.m. 

MCI  Optical  Algorithmic  and  Electronic  Considerations  on  the  Desirable  "Smartness”  of 
Optical  Processing  Pixels,  M.P.Y  Desmulliez,  J.F.  Snowdon,  J.A.B.  Dines  and  B.S.  Wherrett, 
Dept.  Physics,  Heriot-Watt  Univ.,  Edinburgh,  U.K.  The  performance  gains  associated  with 
optical  computing  schemes  must  be  traded  off  with  the  complexity  of  individual  computing  nodes 
in  order  to  optimise  the  efficiency  of  an  optical  processor.  We  categorise  the  necessary  trade¬ 
offs  and  provide  examples  of  such  optimisation. 


V 


2.30  p.m. 

MC3  Low-Loss  and  High-Speed  Optical  Switching  Modulaa  for  1.3  pm  Wavelength  Ueing 
Active  Matrix  Ferroelectric  Liquid  Crystal  Devices,  S.  Shirai,  T.  Serikawa,  S.  Kohda,  N. 
Kakuda,  M.  Okamura  and  N.  Yamauchi,  NTT  Interdisciplinary  Research  Labs.,  Tokyo,  Japan.  A 
highly  efficient  optical  coupling  system  using  microlens  arrays  tor  free* space  optical  switching 
networks  was  proposed  and  demonstrated.  High  speed  (<  100  ps)  and  high  extinction  ratio  for 
1.3  pm  wavelength  were  obtained  by  a  newly  proposed  ferroelectric  liquid-crystal  driving 
method. 

2.45  p.m. 

MC4  Dynamic  Free  Space  Optical  Interconnections  Using  Liquid  Crystal  Micro  Optical 

Beam  Deflectors,  K.  Hirabayashi,  T.  Yamamoto  and  M.  Yamaguchi,  NTT  Communication 
Switching  Labs.,  Tokyo,  Japan.  Micro-optical  beam  deflectors  with  a  high  transmittance  of  95%, 
a  large  deflection  angle  of  15°,  and  with  a  structure  of  liquid  crystals  sandwiched  by  microprism 
plates  are  used  in  tree-space  optical  interconnections  and  optical  switches. 

3.00  p.m. 

MC5  Liquid  Crystal  Over  Silicon  Spatial  Light  Modulators,  DJ.  McKnight,  K.M.  Johnson, 
G.D.  Sharp*  and  R.A.  Serati*,  Optoelectronic  Computing  Systems  Center,  Univ.  of  Colorado, 
U.S.A.  C Boulder  Nonlinear  Systems,  Colorado,  U.S.A).  This  paper  describes  our  recent  liquid 
crystal  on  silicon  spatial  light  modulators.  We  will  present  the  most  recent  results  from  our  256 
by  256  binary  reflection  mode  SLM  and  our  128  by  128  analog  SLM. 

3 .15  p.m. 

MC6  Electronically  Addressed  Ferroelectric  Liquid  Crystal  over  Silicon  Spatial  Light 
Modulators,  D.C.  Bums,  I.  Underwood,  A.  O'Hara*,  and  D.G.  Vass*,  Dept,  of  Elec.  Eng.,  ('Dept, 
of  Physics),  Univ.  of  Edinburgh,  U.K.  We  discuss  recent  advances  in  the  design  and  fabrication 
of  electronically-addressed  ferroelectric-liquid-crystal-over-silicon  spatial  light  modulators.  We 
summarise  the  prospects  for  further  advances  in  the  near  future. 


4.00  -  6.00  p.m.  Poster  Sessions 

Analogue  Processing  MP1  -  MP26 

Digital  Optics  MP27  -  MP53 

Guided-Wave  Devices  and  Processors  MP54  •  MP64 
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Tuesday  23  August,  9.00  - 10.30  a.m. 

Plenary 

( Session  Chair:  T.  Yatagai,  Univ.  of  Tsukuba,  Japan) 

9.00  a.m. 

TuAI  INVITED  -  Optical  Dataction  Filters  and  Algorithm  Fuaion,  D.  Casa  sent,  Carnegie 
Metton  University,  U.S.A..  Detection  of  candidate  object  regions  in  a  scene  is  step  one  in 
general  scene  analysis.  New  optical  morphological,  wavelet  and  Gabor  transform  filters  for 
detection  are  presented  and  fusion  is  used  to  reduce  false  alarms. 

9.30  a.m. 

TuA2  INVITED  •  Space-Variant  Filtering  in  Fractional  Fourier  Domains,  H.M.  Ozaktas,  B. 
Barshan,  D.  Mendlovic*  and  H.  Urey**,  Electrical  Engineering,  Bilkent  Univ.,  Ankara,  Turkey 
(* Faculty  of  Engineering,  Tel-Aviv  Univ.,  Israel,  *' Electrical  Engineering,  Georgia  Institute  of 
Technology,  Atlanta,  U.S.A.).  Signals  with  significant  overlap  in  both  the  space  and  frequency 
domains  may  have  little  or  no  overlap  in  a  fractional  Fourier  domain.  Spatial  filtering  in  these 
domains  may  allow  us  to  eliminate  distortion  components  which  cannot  be  eliminated  in  the 
ordinary  Fourier  domain. 

9.45  a.m. 

TuA3  INVITED  -  All-Optical  Dynamic  Memories,  M.P.  Petrov,  A.F.  Ioffe  Physical  Technical 
Institute  of  Russian  Academy  of  Sciences,  St.  Petersburg,  Russia.  State-of-the-art  review  of  all- 
optical  dynamic  memories  and  their  crucial  elements  is  given.  It  discusses  the  properties  of  all- 
optical  regenerators  of  short  optical  pulses.  Characteristics  and  properties  of  an  optical  dynamic 
memory  operating  through  SRS  in  optical  fibers  are  described. 

10.15  ajn. 

TuA4  INVITED  •  Parallel  Information  Recording  and  Processing  using  1-D  Hologram 
Technology,  A.L.  Mikaelian,  Russian  Academy  of  Sciences,  Russia.  1-D  hologram  technology 
for  parallel  information  recording  and  processing  and  some  applications  are  considered.  Among 
them  are  a  two-layer  neural  net  performing  serial-parallel  processing,  and  an  associative 
memory  using  photothermoplastic  material.  Both  the  theoretical  and  experimental  results  are 
presented. 

10.30  - 11.00  a.m.  Coffee  Break 


Tuesday  23  August,  11.00  - 12.30  p.m.  (Parallel  Session) 

Neural  Networks 

( Session  Chain  D.  Psaftis,  California  Inst,  of  Tech.,  USA) 

11.00  am. 

TuBI  Associative  Memory  for  Rotation  Pre-Processing  and  Projection  Invariant  Pattern 
Recognition,  G.  Lebreton,  GESSY,  University  de  Toulon,  France.  New  advances  on  invariant 
pattern  recognition  using  associative  optical  memories,  showing  simulations  and  first 
experimental  results,  on  1)  orientation  detection  of  any  object  before  identification,  2)  pattern 
recognition  invariant  to  scale  and  projections. 

11.15  a.m. 

TuB2  Optical  Implementations  of  a  Stochastic  Neural  System,  W.A.  Crossland,  T.J.  Hall*, 
J.S.  Shawe-Tayior**  and  M.  van  Daalen**,  Dept,  of  Eng.,  Cambridge  University,  U.K.  ('Dept,  of 
EAE  Eng.,  King's  College,  Univ.  London,  U.K.,  “Dept,  of  Comp.  Sci.,  Univ.  London,  U.K.).  The 
paper  addresses  the  design  of  an  optoelectronic  implementation  of  a  stochastic  bit-stream 
neural  system  which  operates  by  manipulating  digital  bit-streams  to  create  emergent  activation 
functions  using  extremely  simple  logic. 
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11 .30  am. 

TuB3  Dynamic  Effects  in  Vo fume  intarconnccts  with  Feedback,  C.  Stinger,  ER1  Division, 
DRA  MaJvem,  Worcs.,  U.K.  An  attempt  is  made  to  exploit  the  inherently  rich  physics  of  volume 
holographic  interconnects.  The  effects  of  crosstalk  and  nonlinearities  are  considered.  When 
combined  with  a  novel  feedback  configuration,  complex,  self-organising  behaviour  is  seen  to 
emerge. 

11.45  am. 

TuB4  Optical  Lateral  Inhibition  Networks  using  Sett-Linearised  SEED'S,  P.  Horan,  Hitachi 
Dublin  Lab.,  Trinity  College,  Dublin,  Ireland.  A  family  of  lateral  inhibition  architectures  which  use 
the  self -linearised  SEED  effect  to  implement  optical  subtraction  are  described,  and  the  operation 
demonstrated  in  simulation. 

12.00  noon 

TuB5  Optical  Spatiai/Fiequency  Filtering  and  Adaptive  Neural  Networks  Based  on 
Fractional  Fourier  Transforms,  S.-Y.  Lee,  Dept,  of  Elec.  Eng.,  KAIST,  Taejon,  Korea.  Based 
on  fractional  Fourier  transforms  a  new  optical  architecture  is  developed  to  make  compromise 
between  shift-invariant  (frequency)  and  position-dependent  filterings,  and  its  analogy  to  neural 
networks  and  corresponding  learning  algorithm  are  presented. 

12.15  p.m. 

TuB6  Experimental  Realisation  of  an  All-Optical  Self-Organising  Map,  K.  Hegarty,  J. 
Duvillier  and  J.L.  de  Bougrenet  de  la  Tocnaye,  ENST  de  Bretagne,  Brest,  France.  We  present 
results  obtained  with  an  all-optical  self-organising  map  neural  network  applied  to  digit 
recognition.  The  experimental  system  is  built  around  two  Ferroelectric  Liquid  Crystal  Bistable 
Optically  Addressed  Spatial  Light  Modulators  in  a  resonator  configuration. 


Tuesday  23  August,  11.00  - 12.30  p.m. 
(Parallel  Session  -  Lecture  Thsetre  4) 
Digital  Optic  Devices 

(Session  Chain  S.  Ishiham,  AIST,  Tokyo,  Japan ) 


11.00a.m. 

TuCI  Optoelectronic  Switch  Operating  with  0.2  fj/pm2  at  15  MHz,  M.  Kuijk,  P.  Heremans*, 
R.  Vounckx  and  G.  Borghs,  Vrije  Universiteit  Brussel,  Belgium  C IMEC,  Leuven,  Belgium).  Vie 
present  experimental  data  of  a  novel  differential  PnpN  optical  switch,  showing  a  cycle  time 
smaller  than  60  ns  and  an  optical  switching  energy  of  0.2  f  J/pm2. 

11.15  am. 

TuC2  AlAs/GaAs  Multilayered  Structures  as  Low  bitensity  Nonlinear  Optical  Media,  S. 

Knigge,  M.  Wicke  and  D.  Jfiger,  Fachgebiet  Optoetektronik,  Univ.  Duisburg,  Germany.  The 
nonlinear  optical  properties  of  hybrid  AlAs/GaAs  Bragg  reflectors  are  presented.  Experimentally, 
optical  bistability  at  optical  intensities  as  low  as  35  mWcrn"2  is  obtained  when  a  bias  voltage  of 
115  Vis  applied. 

11  JO  am. 

TuC3  Dynamical  Behaviour  of  Opto-Optical  Logic  Switching  Devices  Employing  n-l-p-i- 
bassd  Smart  Pixels,  M.  Kneissl,  P.  Kiesei,  P.  Riel,  K.  Reingruber,  K.H.  Gulden,  E.  Greger,  A. 
Hdfier,  B.  KnOpfer,  G.H.  DOhler,  X.X.  Wu*  and  J.S.  Smith*,  Inst,  fur  Technische  Physik,  Univ. 
Eriangen-Numberg,  Germany  (’Dept.  Elect.  Eng.  A  Comp.  Sd.,  Univ.  California,  Berkeley, 
U.S.A.).  We  report  on  experimental  results  on  the  dynamical  behaviour  of  n-i-p-i-based  smart 
pixels.  With  switching  energies  of  2.4  fj/pm2  contrast  ratios  of  4:1  at  1 .6  mW  output  power  were 
achieved.  The  opto-optical  gain  is  tunable  from  10-1 0®. 


Vlll 


11.30a.m. 

TuC4  An  Optical  Set-Reset  RipFlop  Semiconductor  Laaar  with  Two  Mutually 
Complementary  Outputa,  M.  Watanabe,  S.  Mukai  and  H.  Yajima,  Electrotechnical  Lab., 
Ibarakai,  Japan.  Theory  and  experiment  on  crosscoupied-mode  bistability  in  a  twin-stripe  laser 
is  reported.  The  laser  has  two  output  ports  complementary  to  each  other,  which  is  analogous  to 
a  set-reset  flipfiop  in  electronics. 

11.45  ajn. 

TuC5  Waveguide  Type  Rotating  Phase  Plate  as  Frequency  Shifter  on  (110)  GaAs 
Substrate,  H.  Inoue,  S.  Nishimura,  S.  Tanaka*  and  T.  Kanetake**,  RWCP  Optoelectronics 
Hitachi  Lab.,  Tokyo,  Japan  (* Central  Research  Lab.,  Hitachi  Ltd.,  Tokyo,  Japan,  "Advanced 
Research  Lab.,  Hitachi  Ltd.,  Saitama,  Japan).  The  fundamental  modulation  characteristics  of 
waveguide  type  frequency  shifter  based  on  the  rotating  phase  plate  are  demonstrated  on  (110) 
GaAs  substrate  as  an  optical  device  for  multi-dimensional  interconnection  for  the  first  time. 

12.00  noon 

TuC6  Asymmetric  Light  Bullet  Dragging  Logic,  R.  McLeod,  S.  Blair  and  K.  Wagner. 
Optoelectronic  Computing  Systems  Center,  Univ.  of  Colorado,  U.SA.  The  asymmetric  dragging 
interaction  between  three-dimensional  optical  solitons  may  allow  cascadable,  phase-insensitive, 
NOR  gates  with  gain  to  be  implemented  at  ultrahigh  speeds  in  massively  parallel  three- 
dimensional  bit-level  systolic-array  architectures. 

12.30  -  2.00  p.m.  Lunch  Break 


Wednesday  24  August,  9.00  - 10.30  a.m. 
Plenary 

(Session  Chair  B.  S.  Wherrett,  Heriot-Watt  Univ,  UK) 


9.00  a.m. 

WA1  INVITED  -  Architecture  Design  and  Implementation  Issues  for  Massively  Parallel 
Processors,  S.  Nelson,  Cray  Research  Inc.,  U.SA.  Dramatic  improvements  in  microprocessor 
price/performance  have  challenged  the  supercomputer  designer  to  achieve  high  levels  of 
system  performance  by  interconnecting  hundreds  or  even  thousands  of  processors.  But  to 
achieve  high  sustained  computational  rates  it  is  imperative  that  designs  be  balanced  with 
respect  to  processor  speed,  local  and  global  memory  bandwidth,  input/output  capability,  and 
interprocessor  synchronization  primitives.  This  talk  will  discuss  these  criteria  using  examples 
from  the  recently  announced  CRAY  T3D  system. 


9.45  a.m. 

WA2  INVITED  -  Free-Space  WDMA  Optical  Interconnects  using  Mesh-Connected  Bus 
Topology,  Y.  Li,  S.B.  Rao,  I.  Redmond,  T.  Wang  and  A.W.  Lohmann*,  NEC  Research  Institute, 
Princeton,  U.SA.  (* University  of  Erlangen,  Germany).  A  mesh-connected  bus  networking 
topology  is  proposed  for  implementing  the  three-stage  Clos  network  and  is  experimentally 
demonstrated  using  a  WDMA  technology. 


10.00  a.m. 

WA3  INVITED  Design  Issues  for  Free-Space  Photonic  Switching  Demonstrators,  F.B. 
McCormick,  AT&T  Bell  Labs.,  Naperville,  U.S.A..  We  discuss  the  issues  involved  in  building 
demonstration  systems  integrating  GaAs  FET-SEED  smart  pixels,  computer  generated 
holograms,  2-D  fibre  bundles,  high-power  lasers,  high  resolution  optics,  and  novel 
optomechanical  packaging.  A  prototype  5  stage,  32x16  fabric  operating  at  155  Mb/s  is 
described. 


10.30  •  11.00  Coffee  Break 


IX 


Wednesday  24  August,  11.00  - 12.30  p.m.  (Parallel  Session) 
Digital  Optics  Architectures 

( Session  Chair  Y.  Ichioka,  Osaka  Univ,  Japan) 


11.00  a.m. 

WB1  Demonstration  of  Optically  Controlled  Data  Switching  using  Quantum  Well  Bistable 

Devices  and  Modulatora,  P.  Koppa,  P.  Chavel,  J.L.  Oudar*,  R.  Kuszelewicz*.  J.Ph.  Schnell** 
and  J.P.  Pocholle**,  Inst.  rfOptique,  CNRS,  Orsay,  France,  * France  Telecom,  CNET,  Paris, 
France,  Thomson  CSF,  Orsay,  France).  Experimental  results  on  a  64  channel  free-space 
photonic  switching  system  are  presented.  Two  control  schemes  are  demonstrated:  direct 
optical  addressing  with  potential  signal  amplification  and  self-routing  operation  acting  on  data 
packets. 


11.15  a.m. 

WB2  Reconfigure ble  Architecture  Based  on  Selective  Enabling  of  Microlasers,  M. 

Murdocca,  J.  Battiato*,  0.  Berger**,  R.  Bussjager*  and  T.  Stonet,  Dept,  of  Comp.  Sci.,  Rutgers 
Univ.  New  Brunswick,  U.S.A.  (* Rome  Lab.,  Griffiss  AFB,  U.S.A.,  “Dept,  of  Comp.  Sci.,  Univ.  of 
California,  U.S.A.,  twavefront  Res.  Inc.,  PA,  U.S.A).  We  report  on  a  reconfigurable  architecture 
that  uses  a  static  free-space  optical  interconnect.  A  two-dimensional  array  of  microlasers 
controls  a  two-dimensional  array  of  S-SEED  optical  logic  modulators  in  a  cascaded  system. 


11.30  a.m. 

WB3  Optical  Array  Logic  Network  Architecture,  J.  Tanida  and  Y.  Ichioka,  Dept,  of  Applied 
Physics,  Osaka  Univ.,  Japan.  A  new  concept  called  optical  array  logic  network  architecture 
(OAL-NA)  is  proposed  for  effective  construction  of  optoelectronic  hybrid  computing  system. 
With  the  help  of  optical  array  logic  (OAL),  not  only  data  communication  but  also  global  data 
processing  are  implemented  in  the  interconnection  network. 


11.45  a.m. 

WB4  Cascaded  Optical  Data  Transfer  Through  a  Free  Specs  Optical  Perfect  Shuffle,  M.W. 
Derstine,  S.  Wakelin  and  K.K.  Chau,  Optivision  Inc.,  California,  U.S.A.  We  describe  the  design, 
fabrication  and  testing  of  an  optical  interconnect  system  constructed  from  a  two  dimensional 
array  of  symmetric  self-electro-optic  effect  devices  and  a  free  space  perfect  shuffle  module. 

12.00  noon 

WB5  Optical  Circuitry  for  Data  Transcription  and  Digital  Optical  Logic  Based  on 
Photothyristor  Differential  Pairs,  H.  Thienpont,  T.  Van  de  Velde,  A.  Kirk,  W.  Peiffer,  M.  Kuijk*, 
W.  Stevens,  J.  Fernandez,  I.  Veretennicoff,  R.  Vounckx,  P.  Heremans*  and  G.  Borghs*,  Applied 
Physics  Dept.,  Vrije  Univ.  Brussels,  Belgium  (*lnteruniv.  Micro  Electronics  Center,  Leuven, 
Belgium).  We  present  a  novel  technique  for  parallel  optical  data  transcription  and  digital  logic 
with  high  speed  differential  pairs  of  optical  thyristors  and  demonstrate  these  vital  operations  with 
compact  optical  hardware  circuitry. 

12.15  p.m. 

WB6  Optoelectronic  Multiport  Associative  Memory  for  Data  Flow  Computing 
Architecture,  V.B.  Fyodorov,  Russian  Academy  of  Sciences,  Moscow,  Russia.  New  optical 
setups  of  multiport  optical  associative  memory  are  suggested.  Such  a  memory  enables  M  users 
to  execute  a  simultaneous  and  independent  parallel  associative  data  search  and  retrieval  into 
memory  N  stored  words  by  M  search  arguments,  as  well  as  a  random-access  writing  of  keys 
and  data. 


W 
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Wednesday  24  August,  11.00  - 12. 30  p.m. 
(ParalM  Sssslon  •  Lscturs  Thsstrs  4) 
Analogue  Procsssing 

(Session  Chain  J.  Caulfield,  Alabama  A&M  Univ,  USA) 


11.00  a.m. 

WC1  An  Analog  Retina  for  Edge  Dataction,  C.  Wang  and  F.  Devos,  Inst.  d'Elec.  Fond.,  Univ. 
Paris  Sud,  Orsay,  France).  We  present  a  very  simple  optoelectronic  analog  retina  which  we 
constructed,  using  CMOS  technology.  The  experimental  results  indicate  that  the  system  is 
capable  of  acquiring  optical  data  of  an  image  and  detecting  the  edges. 


11.15  a.m. 

WC2  Theoretical  Results  on  Accuracy  Limitations  in  Analog  Optical  Processors,  D.A. 
Timucin,  J.F.  Walkup  and  T.F.  Krile,  Dept,  of  Electrical  Engineering,  Texas  Tech  Univ.,  U.S.A.  A 
complete  statistical  analysis  and  modelling  of  a  generic  three-plane  optical  processor  is 
presented.  Output  signal  statistics  are  determined  for  a  number  of  interesting  special  cases,  and 
the  fundamental  theoretical  accuracy  limitations  are  established. 

11.30  a.m. 

WC3  Time-Integrating  Correlation  Using  a  Fibre  Optic  Delay  Line  Proceesor,  R.A.  Athale 
and  G.W.  Euliss,  Dept,  of  Elec.  &  Comp.  Engineering,  George  Mason  Univ.  Virginia,  U.S.A.  A 
fibre  optic  tapped  delay  line  correlator  based  on  time-integration  is  proposed  and  demonstrated. 
Such  a  correlator  will  combine  high  bandwidth,  large  processing  gain  and  ability  to  compute  real¬ 
time  autocorrelation  functions. 


11.45  a.m. 

WC4  Experimental  Implementation  of  a  Joint  Transform  Correlator  Providing  Rotation 
Invariance,  L.  Bigue,  M.  Fracas'  and  P.  Ambs,  ESSAIM,  Univ.  de  Haute  Alsace,  France 
CONERA-CERT,  Toulouse,  France).  We  optically  experiment  a  joint  transform  correlator  (JTC) 
where  the  input  image  is  replaced  by  a  synthetic  discriminant  function  (SDF)  filter.  This  latter 
allows  us  to  ensure  a  practical  invariance  over  a  10°  range. 

12.00  noon 

WC5  Optimal  Nonlinear  Filtering  for  Pattern  Recognition  and  Optical  Implementation,  Ph. 

Refregier,  B.  Javidi*.  V.  Laude  and  J.-P.  Huignard,  Thomson-CSF,  Orsay,  France  (“Dept.  Elec. 
Eng.,  Univ.  of  Connecticut,  U.SA.).  An  optimal  processor  for  discrimination  and  noise 
robustness  is  developed  for  pattern  recognition.  It  leads  to  a  natural  and  attractive  high  speed 
optical  implementation.  Furthermore,  it  provides  theoretical  insight  in  previous  heuristic 
nonlinear  filtering  techniques. 

12.15  p.m. 

WC6  Rotation  Scale  and  Shift  Invariant  Real  Time  Pattern  Recognition,  E.  Silvers  and  J. 
Shamir,  Dept,  of  Electrical  Engineering,  Technion-lsrael  Institute  of  Technology,  Haifa,  Israel. 
An  adaptive  pattern  recognition  system  is  presented.  Invariance  is  achieved  by  using  two 
parallel  channels,  each  dealing  with  a  different  distortion.  The  overall  process  is  performed 
efficiently  and  can  be  executed  in  real  time. 


12.30-  2.00  p.m.  Lunch  Break 


Wednesday  24  August,  2.00  -  3.45  p.m. 

Optical  Intsrconnecta/lntsgration 

(Session  Chain  K-H.  Bremer,  Univ.  of  Erlangen,  German y) 


2.00  p.m. 

WD1  Systems  Partitioning  and  Placement  In  Optoelectronic  MCM  Design,  J.  Fan,  S.H.  Lee 
and  C.K.  Cheng,  Dept,  of  ECE,  Univ.  of  California,  U.S.A.  We  discuss  the  CAD  issues  for 
partitioning  between  electrical  and  optical  interconnects  and  physical  layout  in  f.ee-space 
optoelectronic  MCM  design.  The  results  of  a  design  example  are  also  presented. 

2.15  p.m. 

WD2  Architecture  of  a  Terabit  Free-Space  Photonic  Backplane,  T.  Szymanski  and  H.S. 
Hinton,  Dept,  of  Elec.  Eng.,  McGill  Univ.,  Montreal,  Canada.  The  architecture  of  a  'universal* 
photonic  backplane  is  described.  The  architecture  contains  a  large  array  of  'programmable' 
smart  pixels  which  can  be  configured  into  three  states.  By  setting  pixel  states  appropriately,  any 
network  can  be  embedded  into  the  backplane. 

2.30  p.m. 

WD3  A  Scalable  Optical  Interconnection  Network  for  Massively  Parallel  Computers,  A. 

Louri  and  H.  Sung,  Dept.  Elec.  &  Comp.  Eng.,  Univ.  of  Arizona,  U.S.A.  We  present  a  new 
optical  interconnection  network,  called  an  optical  Multi-Mesh  Hypercube,  which  is  both  size-  and 
generation-scalable,  ft  exhibits  such  properties  as  small  diameter,  fault  tolerance,  symmetry, 
constant  node  degree,  and  is  highly  amenable  to  optical  implementations. 

2.45  p.m. 

WD4  Demonstration  of  a  3D-lntegrated  Refractive  Microsystem,  J.  Moisei  and  K.-H. 
Brenner,  Inst,  fur  Angew.  Optik,  Univ.  Ertangen-Numberg,  Germany.  A  3D-integrated 
microsystem  is  presented  which  performs  the  overlay  of  data  planes.  Each  channel  of  the 
microsystem  consists  of  two  gradient-index  microlenses  and  two  microprisms.  The  output  plane 
is  500  pm  squared  ari  contains  approximately  100  pixels. 

3.00  p.m. 

WD5  Integration  of  Free-Space  Interconnects  using  Selfoc  Lenses:  Optical  Properties  of 
a  Basic  Unit,  K.  Hamanaka,  K.  Nakama,  D.  Arai,  Y.  Kusuda,  T.  Kishimoto  and  Y.  Mitsuhashi, 
Nippon  Sheet  Glass  Co.  Ltd.,  Ibaraki,  Japan.  A  vertical  and  horizontal  integration  technique  of 
tree-space  interconnects  using  Selfoc  lenses  is  described.  A  basic  unit  of  the  optical  mother 
board  has  been  fabricated  by  slicing  a  Selfoc  rod  of  4  mm  in  diameter. 


3.15  p.m. 

WD6  Fabrication  of  Fibre  Arrays  for  Optical  Computing  and  Switching  Systems,  J.M. 
Sasian,  R.A.  Novotny,  M.G.  Beckman,  S.L.  Walker,  M.J.  Wojcik  and  S.J.  Hintertong,  AT&T  Bell 
Labs.,  Naperville,  U.S.A.  We  describe  a  technique  for  assembling  fibre  arrays  as  needed  in 
optical  computing  and  photonic  switching.  A  4  x  8  array  was  manufactured  with  fibre  ends  to 
within  1 .5  pm  from  their  ideal  position  and  to  a  pointing  precision  of  30  arc-minutes. 


4.00  -  5.00  p.m.  Poster  Sessions 


Interconnects 

WP1  - 

WP26 

Memory  and  Neural  Networks 

WP28  - 

WP45 

SLMs  and  Smart  Pixels 

WP46  - 

WP56 

Optical  Switching 

WP57  - 

WP71 
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Thursday  25  August,  9.00  - 1090  a.m. 
Interconnection  in  Computers 

(Session  Chain  S.H.  Lee,  Univ.  of  California,  San  Diego,  USA) 


9.00  a.m. 

THAI  Experimental  Results  of  a  64  Channel,  Free  Specs  Optical  Interconnection  Network 

for  Massively  P^illei  Processing,  I.  Redmond  and  E.  Schenfeld,  NEC  Research  Inst., 
Princeton,  U.SA.  We  report  the  experimental  results  of  a  64  channel,  high  data  rate,  free-space 
interconnection  network  lor  massively  parallel  processing  architectures.  It  uses  VCSEL  arrays, 
photodetector  arrays  and  a  passive  optical  routing  network. 

9.15  a.m. 

ThA2  Performance  of  an  Optical  Free-Space  Crossbar,  H.J.  White,  G.M.  Proudley,  C.  Stace, 
N.A.  Brownjohn,  A.C.  Walker*,  M.R.  Taghizadeh*,  B.  Robertson*,  C.P.  Barrett*,  W.A. 
Crossland**,  J.R.  Brocldehurstt,  M.J.  Birch+,  M.  Snook+  and  D.  Vass+,  BAe  Sowerby  Res. 
Centre,  Bristol,  U.K.  CDept.  of  Physics,  Heriot-Watt  Univ.,  Edinburgh,  U.K.,  “Eng.  Dept.,  Univ. 
of  Cambridge,  U.K.,  tThom  EMI  CRL,  Middlesex,  U.K.  +Dept.  of  Physics.,  Edinburgh  Univ., 
U.K.).  Initial  results  on  the  performance  of  a  64  input  64  output  free-space  optical  crossbar  are 
reported.  The  components  of  the  compact  and  ruggedised  system  are  described. 


9.30  a.m. 

ThA3  A  FET-SEED  Based  Optical  Backplane  Demonstrator,  D.V.  Plant,  B.  Robertson,  H.S. 
Hinton,  W.M.  Robertson,  G.C.  Boisset,  N.H.  Kim,  Y.S.  Liu,  M.R.  Otazo,  A.Z.  Shang  and  L.  Sun, 
Dept,  of  Electrical  Eng.,  McGill  Univ.,  Montreal,  Canada.  We  demonstrate  a  representative 
portion  of  an  optical  backplane  using  free-space  optical  channels  to  interconnect  printed  circuit 
boards  which  employ  FET-SEED  based  smart  pixel  arrays.  Results  of  system  demonstrator 
performance  will  be  presented. 


9.45  a.m. 

ThA4  High  Speed  Parallel  Switching  of  Symmetric  Self-Electrooptic  Effect  Devices  (S- 
SEEDs),  D.  Goodwill,  D.A.  Baillie  and  F.A.P.  Tooley,  Dept,  of  Physics,  Heriot-Watt  Univ., 
Edinburgh,  U.K.  An  S-SEED  switching  experiment  has  been  designed  and  constructed  to 
investigate  operation  at  5-50  MHz  using  powers  of  1  mW/device  over  a  512  device  array. 
Details  of  the  implementation  and  experimental  results  will  be  presented. 


10.00  a.m. 

ThA5  Arrays  of  Field  Effect  Transistor-Self  Electrooptic  Effect  Device  (FET-SEED) 
Differential  Transimpedance  Amplifiers  for  Two  Dimensional  Optical  Data  Links,  R.A. 
Novotny,  M.J.  Wojcik,  A.L  Lentine,  L.M.F.  Chirovsky*.  LA.  D'Asaro*,  M.W.  Focht**,  G.  Guth**, 
K.C.  Glogovsky**,  R.  Leibenguth**,  M.T.  Asom**  and  J.M.  Freund**,  AT&T  Bell  Labs., 
NapenriHe,  U.S.A.  CAT&T  Bell  Labs.,  Murray  Hill,  U.S.A.,  “AT&T  Bell  Labs,  Breinigsville, 
U.S.A.).  Two  dimensional  (4x18)  arrays  of  Field  Effect  Transistor-Self  Electrooptic  Effect 
Device  tranaimpedance  receivers  have  been  fabricated  for  application  in  massively  parallel 
optical  data  links.  Up  to  lOOMbps/channel  was  demonstrated.  Test  results  are  discussed. 

10.15  am 

ThA6  INVITED  -  The  Optical  Computing  of  National  963  High  Technology  Program  in 
China,  Y.  Zhang,  Institute  of  Optoelectronic  &  Precision  Engineering,  Tianjin  University,  P.R. 
China  Some  of  the  achievements  of  National  863  High  Technology  Program  in  China  are 
summarised. 


10.30- 11.00  Coffee  Break 
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Thursday  25  August,  11.00  - 12.30  p.m. 

Digital  Optics  Implementations 

( Session  Chair  H.  S.  Hinton,  Univ.  of  Colorado,  USA) 

11.00  ajn. 

ThBI  INVITED  -  ParaUal  Opto alactr onto  Procsssing  Systsms  and  Appltoations,  M. 

Ishikawa,  Dept.  Math.  Eng.  &  Info.  Phys.,  Univ.  of  Tokyo,  Japan.  A  parallel  computing  system 
using  optical  interconnection  and  its  applications  for  visual  information  processing  are  shown. 
The  system  can  be  integrated  into  one  chip  by  using  VLSI  technology. 

11.30 

ThB2  INVITED  -  Free  Space  Holographically  Interconnected  Counter,  R.J.  Feuerstein,  D.C. 
O'Brien,  A.  Fedor,  M.C.  Chang  and  L.H.  Ji,  Optoelectronic  Computing  Systems  Center,  Univ.  of 
Colorado,  U.SA.  We  have  constructed  a  simple  4-bit  counter  using  holographic  interconnects, 
microlens  arrays,  vertical  cavity  lasers,  and  a  CMOS  detector  array  chip.  Performance  of  this 
prototype  system  will  be  discussed. 


11.45  a.m. 

ThB3  INVITED  -  Versatile  Compact  Image  Processor  with  Optical  Feedback  using 
Photopolymer  and  Ferroelectric  Liquid  Crystal  on  Amorphous  Silicon,  P.  Cambon*.  J. 
Sharpe  and  K.M.  Johnson,  Optoelectronic  Computing  Systems  Center,  Univ.  Colorado,  U.S.A. 
(‘GOSC,  Tdldcom  Bretagne,  Brest,  France).  An  optical  image  processor  with  optical  feedback 
and  gray  scale  capability  compactly  organised  around  bistable  binary  amorphous  silicon  and 
ferroelectric  liquid  crystal  devices  and  a  one  lens  correlator  using  reflective  multiplexed 
photopolymer  hologram  is  presented. 

12.00  noon 

ThB4  INVITED  -  Digital  Optical  Computing  Demonstration  Systems,  F.A.P.  Tooley,  S.M. 
Prince,  D.  Baillie,  D.  Goodwill,  M.  Desmulliez  and  M.R.  Taghizadeh,  Department  of  Physics, 
Heriot-Watt  University.  Edinburgh,  U.K.  Details  of  the  implementation  of  two  optical  computer 
demonstrators  will  be  presented:  a  64-channel  S-SEED,  cellular  logic  image  processor  with  a 
dynamic  interconnect  and  a  sorting  module  implemented  by  interconnecting  smart  pixels  with  a 
shuffle. 


12.15  pjn.  Closing  Remarks 


Pierre  Chavel 


Institut  d'Optique,  Paris,  France 
(ICO  representative) 


MAI/1 


Quantum  Well  Smart  Pixels  for  Optical  Switching  and  Processing 


David  A.  B.  Miller 
Rm.  48401,  AT&T  Bell  Laboratories, 

101  Crawfords  Comer  Road, 

Holmdel,  NJ  07733,  USA 

Abstract 

A  broad  variety  of  arrays  of  experimental  smart  circuits  with  quantum  well  optical  inputs  and 
outputs  has  been  fabricated  for  many  users.  Currant  status  and  future  prospects  for  such  large  scale 
optoelectronic  integration  are  summarized. 


Quantum  well  optical  modulator  diodes  have  proved  to  be  very  useful  devices  for  many 
optoelectronic  information  processing  and  switching  systems  experiments.  They  can  be  made  in  large 
numbers  (e.g.,  1000‘s)  with  usable  yields,  and  can  be  used  both  as  optical  input  devices  (since  they 
operate  as  good  photodiodes)  and  optical  output  devices  (operating  as  absorption  modulators). [1,2] 
The  devices  themselves  are  low  power,  high  speed,  and  apparently  reliable.  They  have  been  used  on 
their  own  in  various  configurations  to  perform  optoelectronic  logic  and  analog  optical  operations  in 
the  simpler  of  die  so-called  self-electrooptic-effect  devices  (SEEDs). 

The  diodes  have  also  been  successfully  integrated  with  electronic  devices  in  various  ways.  FET- 
SEEDs  are  monolithic  integrations  of  quantum  well  diodes  as  detectors  and  as  modulators  together 
with  GaAs  field  effect  transistors.[3]  They  have  also  been  successfully  grown  on  silicon  substrates,[4] 
and  they  have  recently  been  integrated  using  a  hybrid,  solder-bump  bonding  onto  silicon  integrated 
circuits.[5]  The  integration  with  electronics  brings  two  advantages:  (i)  the  optical  input  energy  needed 
can  be  reduced  by  the  use  of  electronic  gain  -  an  important  feature  since  the  performance  of  large 
systems  is  usually  limited  more  by  available  optical  power  rather  than  speed  limits  in  the  devices;  (ii) 
the  use  of  advanced  electronic  technologies  allows  much  more  complexity  in  the  "nodes"  or  "smart 
pixels"  -  empirically  it  now  appears  that  most  potential  applications  need  complexity  in  the  nodes  to 
perform  the  functions  that  the  systems  need. 

The  FET-SEED  technology  is  currently  the  most  advanced  for  practical  applications.  It  has  been 
run  in  a  5-stage  switching  system  [6]  at  system  speeds  of  155  Mb/s,  using  chips  with  400  transistors 
and  96  quantum  well  input  or  output  diodes.  Individual  circuits  have  been  run  at  650  MHz  or  with 
input  energies  as  low  as  20-30  0.  The  simpler  symmetric  SEED  (S-SEED)  technology  has  been  used 
in  arrays  of  2048  devices  in  multistage  system s  at  slower  speeds  (e.g.,  100  kHz)  with  up  to  60,000 
light  beams  in  the  system  overall.  The  various  integrations  on  silicon  circuits  are  still  at  an  earlier 
stage  of  device  research,  but  are  promising  for  integration  with  highly  complex  circuits;  arrays  of 
hybrid  devices  are  being  successfully  fabricated  now  on  silicon  circuits. 

In  1993,  an  experimental  workshop  was  run  in  conjunction  with  the  ARPA  CO-OP  program  in  the 
Unted  States  that  trained  users  how  to  design  in  the  PET-SEED  technology  and  fabricated  the  chips 
designed  by  the  users.  The  chips  were  delivered  to  users  in  early  1994.  One  goal  of  this  workshop  was 
to  stimulate  systems  and  applications  research  with  parallel  optoelectronic  technologies,  and  users 
designed  a  broad  variety  of  circuits  in  what  may  be  the  first  muhqmgect  optoelectronic  wafer 
fabrication.  The  results  of  this  workshop  we  currently  being  reviewed,  but  first  indications  are  that 
this  was  a  successful  project  and  that  there  is  demand  for  greater  availability  of  this  kind  of  service 
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for  system  research  purposes.  This  will  likely  stimulate  further  such  workshops  or  foundry  services 
for  optoelectronic  arrays  in  this  and  other  such  technologies. 
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Abstract 

We  describe  a  mew  process  to  integrate  LEDs  with  fully  processed  GaAs  circuits  using  MBE 
regrowtk.  The  bottom  contact  of  the  LED,  grown  in  the  dielectric  vias,  is  made  through  the  n-f 
source/drain  implant. 

In  this  paper,  we  describe  a  process  for  integrating  LEDs  (Light  Emitting  Diodes)  by  Molecular 
Beam  Epitaxial  regrowth  on  high  density  GaAs  MESFET  (Metal-Semiconductor  Field-Effect  Transistor) 
circuits  fabricated  by  Vitesse  Semiconductor  Corp.  through  the  U.S.  chip  prototyping  service,  MOSIS 
(MOS  Integration  Service).  The  Vitesse  process  available  through  MOSIS  offers  enhancement-mode 
(Vt=0.25V)  and  depletion-mode  (Vt=-0.6V)  GaAs  MESFETs  (the  minimum  gate  length  is  0.8pm), 
optical  FET  and  MSM  (Metal-Semiconductor-Metal)  photodetectors,  Schottky  diodes,  and  four  levels  of 
aluminum  interconnect  metalization.  Each  layer  of  interconnect  metal  is  separated  by  a  layer  of  SiOj 
dielectric.  The  total  thickness  of  the  dielectric  stack  is  4pm.  The  key  features  which  allow  the  circuits  to 
withstand  (Al,Ga)As  MBE  (Molecular  Beam  Epitaxy)  regrowth  are  that  all  of  the  metalization  layers, 
i.e.  the  gate-metal,  the  source-drain  ohmic  contact,  and  the  interconnect  metal,  are  stable  up  to  500* C 
and  none  of  the  layers  contain  gold  which  would  degrade  the  device  performance. 

The  chip  layout  for  the  optoelectronic  circuits  is  done  using  standard  CAD  tools.  Figure  1  shows 
the  cross-section  of  the  Vitesse  chip  with  the  LED  epitaxial  layers  regrown  in  the  dielectric  vias.  In  the 
areas  where  the  LED  material  is  later  grown,  an  n+  source/drain  implant  is  specified  along  with  two 
overlapping  dielectric  etch  layers,  to  specify  the  area  where  the  dielectric  covering  the  substrate  should  be 
removed.  The  dielectric  vias  for  the  LEDs  have  been  made  as  small  as  10pm  x  10pm  with  vertical  ride 
walls.  The  n-f  source/drain  ion-implant  under  the  LED  provides  the  bottom  n  contact  for  the  LED  so  that 
one  terminal  of  the  LED  can  be  directly  connected  to  the  rest  of  the  circuit  without  any  post-processing. 
After  receiving  the  chip  from  MOSIS  with  the  MESFETs,  interconnect  circuitry,  and  the  dielectric  vias 
for  the  LEDs,  the  first  step  is  to  degrease  the  wafer  and  clean  the  GaAs  surface  with  a  buffered  oxide  etch. 
The  chip  is  then  loaded  into  an  MBE  chamber  and  a  double  heterojunction  GaAs/AlGaAs  LED  structure 
is  grown  at  530*C.  The  total  thickness  is  4pm  so  that  the  top  p-f  GaAs  layer  is  planar  with  dielectric 
stack.  The  superlattice  at  the  bottom  helps  impede  defects  from  the  substrate  surface  from  propagating 
upwards.  After  unloading  the  chip  from  the  MBE  chamber,  the  single  crystalline  LED  material  in  the 
vias  is  masked  off  with  photoresist  so  that  the  polycrystalline  GaAs  material,  which  forms  over  the  rest  of 
the  chip,  can  be  removed  using  a  wet  chemical  etch.  The  final  processing  step  is  deposition  of  AuZn/Au 
for  the  top  pH-  contact. 

Figure  2  is  a  photograph  of  part  of  a  GaAs  MOSIS  chip  after  the  MBE  regrowth.  The  circuit 
surrounded  by  the  7  electrical  pads  is  a  3  unit  winner- take-all  circuit.  The  function  of  the  winner-take-all 
circuit  is  to  determine  the  unit  with  the  largest  input  signal.  The  LEDs,  located  to  the  right  of  the 
initials  “AG  JL  dP",  are  the  output  signals  for  the  circuit.  The  dimensions  of  the  dielectric  vias  for  the 
LEDs  are  40pm  x  40pm.  The  only  LED  which  will  be  on  will  be  the  one  with  the  largest  optical  input 
power.  Optical  FETs  w ere  used  in  this  circuit  as  photodetectors  because  of  their  high  responrivity  (on 
the  order  of  1000A/W  at  lOnW  input  power  levels).  Figure  3  shows  the  output  power  of  two  competing 
units  of  the  winner-take-all  circuit.  The  optical  power  on  unit  1  was  fixed  at  30nW  while  the  power  on 
unit  2  varied  from  0  to  lOOnW.  There  was  no  input  to  the  third  unit  so  consequently  its  output  remained 
off.  The  transconductance  of  the  enhancement-mode  MESFET  after  regrowth  was  30m S/ mm  and  the 
gate  length  was  1pm.  The  efficiency  of  the  LEDs  was  5  x  10~4W/A,  which  is  the  one  of  the  reasons  for 
the  low  output  power  in  the  circuit.  Another  reason  is  barkgating  on  the  chip  which  reduces  the  output 
current  and  responrivity  of  the  detectors.  Improvements  are  expected  with  refined  LED  regrowth  and 
processing  techniques. 


Figure  1  Cross-section  of  &  Vitesse/ MOSIS  chip  Figure  3  Output  Power  of  Two  Competing 

with  LED  grown  in  the  dielectric  via.  Branches  of  a  Winner-Take- All  Circuit 


Figure  2  Photograph  of  a  Vitesse/MOSIS  chip  with  the  LEDs  grown  in  the  dielectric  vias. 
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Abstract  Fine  grain  parallel  optoelectronic  processors  for  dedicated  tasks  may 
deserve  further  consideration.  The  main  case  is  vision  machines  ;  stochastic 
algorithms  working  on  images  considered  as  Markov  Fields  are  one  possible 
approach  to  deal  with  real  images. 

Introduction 

The  processing  of  real  images  is  one  area  where  optical  computing  may  come  up  with 
competitive  solutions.  The  basic  reason,  as  usual,  is  the  number  of  interconnects  that  are 
needed  before  any  sensible  processing  task  on  an  image  is  completed  ;  the  number  of 
interconnects  required  is  particularly  large  in  parallel  implementations,  which  is  where  free 
space  optics  is  attractive.  The  word  interconnect  should  be  understood  in  a  broad  sense, 
including  data  input  and  output  and  the  provision  of  various  control  signals  to  the  processing 
elements  as  well  as  exchange  of  information  among  the  processors  themselves.  We  are 
interested  in  "optical  scale"  parallelism,  where  the  processor  consists  in  a  number  of 
processing  elements  (PE's)  equal  to  the  number  of  pixels  in  the  image,  typically  at  least  10* : 
this  puts  the  heaviest  weight  on  interconnects  but  alleviates  considerably  programme  and 
data  transfer  control.  For  easy  image  format  input,  all  PE's  should  fit  together  on  a  chip  a 
few  square  centimetres  on  a  side.  But  technology  will  allow  to  integrate  only  fairly  weak 
PE's  on  such  a  small  area.  We  are  then  faced  with  a  double  challenge :  devise  optoelectronic 
architectures  that  make  the  best  use  of  optical  interconnects  to  maximise  the  computing 
power  and  find  algorithms  that  can  use  it  for  meaningful  image  processing  tasks. 

Convolutions 

The  simplest  and  most  famous  application  Of  the  above  concept  is  the  optical  convolution, 
where  the  PE  reduce  to  photodetectors  and  where  weighted  optical  interconnects  that  define 
the  impulse  response  do  all  the  processing.  The  main  application  is  pattern  recognition.  The 
double  challenge  then  takes  on  the  following  form :  can  convolution  be  helpful  in  real 
pattern  recognition  problems  and  if  yes,  can  optics  implement  such  convolutions  ?  Progress 
cm  filter  reprogrammabili ty,  adaptivity  to  the  input  signal  and  invariances  will  be  reported  in 
several  papers  in  this  conferences. 

Optical  Cellular  Automata : 

Convolution  nevertheless  shows  only  limited  generality  and  it  is  important  to  seek  broader 
classes  of  our  image  processors.  The  next  simple  case  is  cellular  automata,  that  have  been 
investigated  in  some  detail  for  some  years  under  various  names,  including  symbolic 
substitution  and  mathematical  morphology.  Their  operation  cycle  consists  in  the 
combination  of  one  convolution  and  one  point  nonlinear  operation.  The  main  role  of  optics 
here  is  to  implement  the  convolution  part,  while  optoelectronic  or  nonlinear  optical  devices 
located  at  every  pixel  respond  nonlinearly  to  its  result.  The  invited  communication  by 
Casasent  in  this  conference  develops  mathematical  morphology  applications  for  a  number  of 
image  processing  tasks. 

Optimisation  problems  in  image  processing 

One  further  step  is  to  introduce  optimisation  problems  into  the  realm  of  optical  computing. 
In  an  optimisation  problem,  an  energy  function  £(2}  is  introduced  as  a  measure  of  the 
departure  of  an  image  2  from  an  ideal  goal  to  be  reached  by  the  processing.  The  definition 
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of  function  E  incorporates  all  relevant  knowledge,  i.e.  the  input  data  but  also,  for  example, 
the  sources  of  degradation  to  be  removed,  a  priori  information  on  the  class  of  object,  and  the 
features  of  interest.  Tire  processing  minimises  the  energy  with  respect  to  x . 

Previous  algorithmic  work,  notably  by  Geman  et  al.  who  used  the  approach  of 
Markov  random  fields  to  statistically  describe  the  space  of  "real  images",  has  demonstrated 
energy  functions  that  can  detect,  for  example,  texture,  edge  or  motion  in  fairly  difficult 
situations.  However,  the  computational  load  is  usually  extremely  heavy  because  a  small 
change  in  the  image  can  generate  a  large  change  in  the  energy  -  the  "energy  landscape"  is 
said  to  be  wild  •  so  that  secondary  minima  will  prevent  simple  gradient  descent  algorithms 
from  reaching  the  desired  minimum  or  even  an  acceptable  suboptima]  solution.  As  a 
consequence,  it  is  not  only  impossible  in  practice  to  find  the  absolute  minimum  through 
exhaustive  search  in  the  [x]  space,  but  even  the  efficient  suboptimal  procedures  like 
simulated  annealing  are  hardly  practicable  unless  some  way  can  be  found  to  implement  them 
in  parallel :  as  we  shall  illustrate  now,  their  parallelisation  is  where,  in  our  opinion,  optics 
may  have  a  new  role  to  play. 


Optics  for  simulated  annealing 


In  the  simplest  version  of  simulated  annealing,  the  energy  is  first  differentiated  with  respect 
to  some  variable  ;  here,  the  variable  are  the  grey  values  Xjj  of  the  pixels.  If  the  result  of 

changing  Xjj  to  Xij  +  6x  is  an  energy  increment  AE,  a  random  test  is  made  so  that  the  change 
is  accepted  with  a  probability  of  order  exp^— — j ,  where  T  is  an  simulated  "temperature". 


The  process  is  then  iterated  and  the  temperature  is  decreased  at  some  appropriate  slow  pace. 
Optical  computing  can  provide  three  functions. 

Firstly,  energy  functions  can  at  least  sometimes  be  constrained  to  quadratic  variations 


in  the  Xjj.  AE  is  then  a  convolution  of  the  image,  and  we  are  back  to  the  first  section. 

Secondly,  the  parallel  generation  of  a  large  amount  of  random  numbers  of  a  good 
statistical  quality  can  be  best  handled  by  a  physical  rather  than  digital  approach,  and  we  have 
demonstrated  the  use  of  speckle  to  provide  around  10 10  random  numbers  per  second  and 
project  them  onto  a  photodetector  array,  such  as  an  array  of  "smart  pixels".  In  particular,  we 
have  shown  that  speckle  statistics  can  be  easily  moulded  into  exactly  the  required  form  of 
probability  law,  and  that  the  simulated  temperature  can  be  controlled  directly  by  the  average 
speckle  brightness,  i.e.  the  laser  power. 

Finally,  novel  optoelectronic  or  nonlinear  optical  arrays  such  as  SEEDs  or  pnpn 
photothyristors  may  be  used  to  make  the  required  decision. 

For  the  simplest  forms  of  the  energy  function,  it  may  then  be  possible  to  devise  an 
"all-optical"  solution.  In  more  elaborate  cases,  integrated  circuits  will  be  required  to  evaluate 

AE  but  the  role  of  optics  to  provide  the  random  numbers  and  possibly  the  convolution  part 


of  AE  is  still  an  important  factor  to  open  the  way  to  compact,  massively  parallel  integration 
of  image  processors  for  such  algorithms. 


Conclusion 

The  favourite  operation  of  analogue  optical  processing,  convolution,  may  provide  a  solution 
for  a  certain  number  of  image  processing  problems.  But  we  believe  that  it  could  well  be 
combined  with  other  readily  available  optical  functions  and  with  integrated  circuit 
microtechnology  into  video-real-time  systems  for  a  significantly  wider  class  of  vision 
problems. 

We  acknowledge  the  contributions  of  F.  Devos,  P.  Garda,  G.  Prtmont  D.  Provost  and 
J.C.  Rodier  to  the  research  leading  to  this  work. 
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Abstract .  Quantun  effects  are  shown  to  limit  the  information  capacity  of 

12 

electronic  channels  by  ~  10  nit/s.  At  higher  rates  the  energy  price  for 

transmitting  information  is  2-4  order  less  in  lD-boson  channels  as 

compared  with  lD-fermlon  ones.  The  3D-optical  channels  are  free  of  quantum 

19  2 

restrictions  up  to  information  rates  ~  10  nit/cm  s. 

Because  bosons  and  fermions  obey  the  different  statistics,  the  behaviors 
of  the  electronic  and  photonic  information  channels  are  expected  to  be 
essentially  different  in  the  cases  when  quantum-statistical  effects  take 
place.  With  use  of  the  Fermi-Dirac  distribution  function  we  have  derived 
the  expressions  for  calculating  the  fermion  information  channel  capacity 
and  minimum  energy  required  for  transmitting  one  nit  of  information.  The 
numerical  estimates  are  compared  with  those  of  Lebedev  and  Levitin  [1]  for 
lD-boson  channels. 


Flg.l.  Information  channel  properties  at  300  K.  Indexes  b  and  f  correspond 
to  lD-boson(photon)  and  lD-fermion (electron)  channels  respectively. 
(Pj  -  PR)  equals  the  boson  signal  power  and  gives  the  minimum 

estimate  for  the  fermion  signal  power  because  of  the  statistical 
dependence  of  signal  and  noise  fermions. 
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For  both  types  of  channels  the  quantua-statlstical  effects  play  an 

important  but  quantitatively  different  role  when  the  power/energy  of 

signals  is  effectively  presented  by  the  Fourier  components  with 

frequencies  higher  than  the  boundary  thermal  noise  frequency  nnoise~k^0/h- 

It  is  seen  from  Fig.l,  that  for  ID- channel  at  room  temperatures  this 

corresponds  to  the  information  transmitting/processing  rates  of  about 
12 

10  nit/s.  Below  these  rates  both  types  of  channels  behave  equally  in 

agreement  with  the  classical  Shennon’s  theory. 

13 

At  the  information  rates  >  10  nit/s  the  difference  in  behaviour  of 
fermion  and  lD-boson  channels  increases  so  much  that  the  energy  price 
transmitting  the  same  information  becomes  3-4  order  of  magnitude 
gi  er  for  fermion  channels  as  compared  with  photon  ones.  It  should  be 
noted  that  transmitting  and  processing  data  will  result  in  "nonlinear" 
distortions  and  losses  of  information  at  the  conditions  when  Information 
channel  capacities  are  no  longer  the  linear  functions  of  the  input  signal 
powers . 

Thus,  the  analysis  of  quantum-statistical  restrictions  in  the 

information  channels  allows  us  to  consider  the  value  of  information 

12 

transmitting/processing  rate  of  about  10  nlt/s  as  the  fundamental  limit 
for  electronic  systems  (which  are  lD-systems  by  the  nature). 

For  3D-photon  channels  a  generalization  of  the  Lebedev’s  and  Levitin’s 
formula  [lj  for  the  information  channel  capacity  is  obtained  and  it  is 
shown  (see.  Fig. 2)  that  such  channels  are  free  of  quantum-statistical 


Signal  intensity,  W/cm2  Information  rate,  nit/cm2s 

Fig. 2.  3D-boson (photon)  information  channel  properties.  Dashed  lines  in 
(a)  show  the  classical  Shennon’s  asymptotes. 

19  2 

restrictions  up  to  information  transmitting  rates  of  about  10  nit/cm  s. 
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Abstract.  Spatial  light  modulator  based  routing  systems  offer  potential  advantages  over 
conventional  electronic  and  waveguide-based  systems.  Integration  of  ferroelectric  liquid  crystal 
over  silicon  smart  pixel  modulators  permits  high  levels  of  switch  parallelism  to  be  achieved. 

1.  Introduction 

Free-space  optica]  interconnection  enables  much  greater  fan-out  and  fan-in  than  can  be  accessed 
by  electronics.  These  features  make  it  appropriate  to  re-examine  single-stage  crossbar  archi¬ 
tectures.  The  purpose  of  this  review  is  to  summarise  the  technology  of  spatial  light  modulator 
(SLM)  routing  systems  and  to  consider  control  and  arbitration  issues. 

2.  Device  technology 

The  modulators  in  the  systems  proposed  here  are  intended  to  be  silicon  integrated  circuits 
with  an  overlay  of  chiral  smectic  liquid  crystal  [1].  The  very  large  electro-optic  effects  that 
are  observed  enable  high  fan-out /fan-in  switch  structures  to  be  constructed,  and  the  electronic 
functionality  of  the  silicon  backplane,  capable  of  electronically  controlled  mirrors  and  sensitive 
photodetectors,  forms  a  powerful  ‘smart  pixel’  technology. 

2.1.  Matrix-matrix  crossbar 

An  optical  vector-matrix  processor  for  fast  Fourier  transform  calculations  was  proposed  in 
1978  [2],  later  identified  as  a  single-stage,  non-blocking  space-switch,  and  then  extended  to 
the  general  matrix-matrix  crossbar  [3].  These  architectures  passively  fan-out  each  optical  input 
towards  every  output.  The  replications  are  then  ‘shadowed’  by  means  of  a  reconfigurable  shutter 
array  and  hence  selective  fan-in  is  achieved  onto  the  output  plane,  figure  1. 

Electrically  addressed  SLMs  operating  in  a  binary  transmission  mode  are  suitable  high¬ 
speed  switching  arrays  that  can  be  used  as  the  shutter  plane.  Each  input  replication  must 
be  optically  resolved  through  a  single  shutter  such  that  any  arbitrary  interconnection  pattern 
may  be  formed,  including  broadcast,  multicast  and  multiple-input  fan-in.  Reconfiguration  of 
the  switch  simply  involves  closing  any  (single)  shutters  corresponding  to  completed  calls  and 
opening  any  new  paths  required. 

2.2.  Dynamic  holographic  crossbar 

The  generation  and  use  of  computer-generated  holograms  is  well  documented,  e.g.,  [4].  The 
principle  of  operation  of  this  architecture  is  the  use  of  holograms  to  deflect  as  much  optical 
power  as  possible,  hence  eliminating  the  need  for  the  initial  fan-out  operation  associated  with 
the  matrix-matrix  architecture.  Each  routing  area  is  filled  with  one  from  a  set  of  base  holo¬ 
grams  which  are  stored  in  a  non-volatile  memory  behind  the  interconnect  plane,  figure  S.  Each 
hologram  acts  as  an  independent  diffraction  grating  and  also  provide  broadcast,  multicast  and 
fan-in  capabilities  [5]. 

Electrically  addressed  SLMs  operating  in  a  binary  phase  mode  produce  a  more  efficient 
system  than  amplitude  mode  devices,  but  both  techniques  lead  to  a  redundant  symmetry  in  the 
output  plane.  A  means  of  breaking  this  symmetry  without  significantly  increasing  the  system 
complexity  or  cost  is  proposed,  [6].  Control  of  this  switch  involves  periodically  transferring  im- 
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ages  from  memory  onto  the  SLM  display.  A  key  scaling  issue  to  be  addressed  is  the  number  of 
pixels  that  are  required  per  routing  hologram. 


Shutters 
(Not  At*  shown) 

Figure  1:  Matrix-matrix  crossbar  concept. 


Figure  2:  Holographic  crossbar. 


S.  Data  throughput,  switch  control  and  arbitration  ^ 

The  routing  algorithms  for  non-blocking  single-stage  switches  are  very  simple,  all  paths  through 
the  switch  being  mutually  independent.  The  control  information  may  be  embedded  in  the  main 
data  paths  using  the  existing  free-space  optics,  or  it  may  be  separate  from  them. 

The  detection  of  contention  external  to  the  switch  and  the  taking  of  the  steps  necessary 
to  stop  multiple  channels  being  directed  to  the  same  port  are  also  considerably  simplified  in  ( 

these  structures.  Facilities  for  resolving  contention  can  be  built  directly  into  the  switch  control 
electronics  on  the  silicon  backplane  of  the  interconnect  SLM. 


4.  Conclusions 

The  data  throughput  of  optically  transparent  fibre  optic  switches  is  determined  by  the  optical 

power  budget  and  can  be  very  high.  The  use  of  smart  pixel  systems  therefore  provides  a  viable  * 

switch  technology.  Results  to  be  presented  suggest  that  the  matrix-matrix  crossbar  is  not  as 

critically  dependent  on  SLM  contrast  ratio  as  is  normally  assumed.  For  both  architectures,  a 

significantly  lower  signal-to-noise  ratio  may  still  produce  acceptable  performance  because  of  the 

single-stage  nature  of  the  switches.  The  holographic  crossbar  typically  exhibits  higher  SNR  and 

lower  loss  than  the  matrix-matrix  architecture.  < 
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Abstract 

The  fabrication  of  very  uniform,  planar  10  x  10  vertical-cavity  surface-emitting  laser  diode 
arrays  is  reported.  Individual  elements  show  threshold  currents  around  4  mA,  emit  up  to  300 
/rW  single  mode,  and  exhibit  a  3  dB  modulation  bandwidth  of  4.6  GHz. 


Independently  addressable,  two  dimensional  laser  diode  arrays  have  many  applications  for 
example  in  optical  scanners,  display  technology,  optical  interconnects,  high  capacity  switching 
systems,  wavelength  division  multiplexing  systems,  or  for  the  generation  of  high  power  coherent 
beams  [1],  Using  planar  vertical-cavity  surface-emitting  lasers  (VCSEL),  such  arrays  with  high 
packing  densities  are  relatively  easy  to  fabricate  [2].  VCSELs  allow  single  longitudinal  mode 
oscillation,  very  low  threshold  currents,  and  alignment  tolerant  and  efficient  coupling  into  single 
mode  fibers  [3]. 

VVe  have  fabricated  10  x  10  independently  addressable  VCSEL  arrays.  The  lasers  were 
grown  by  molecular  beam  epitaxy.  The  active  region  contains  3  InGaAs  quantum  wells  and  is 
embedded  between  AlAs-GaAs  Bragg  reflectors.  Lateral  current  confinement  and  insulation  of 
the  individual  lasers  is  achieved  by  proton  implantation  with  an  energy  of  300  keV  and  a  dose 
of  5  •  1014  cm-2.  The  reflectivity  of  the  upper  Bragg  reflector  is  improved  with  an  additional 
Au  layer  on  top.  TiAu  rings  form  the  ohmic  p-contacts.  The  100  elements  are  connected  to  25 
bonding  pads  at  either  side  of  the  5x5  mm  array  chip  by  metallic  TiAu  feeding  lines  which  are 
deposited  on  an  insulating  SiO  layer.  A  schematic  of  the  vertical  structure  and  a  top  view  of  a 
full  chip  is  depicted  in  Fig.  1.  Individual  lasers  have  active  diameters  of  12  fim  and  the  pitch 
size  of  the  array  is  250  /rm. 

The  processed  arrays  show  a  very  good  homogeneity  of  the  output  characteristics  over  the 
whole  array  size.  A  typical  threshold  current  distribution  is  shown  in  Fig.  2.  Almost  all  lasers 
exhibit  threshold  currents  between  4.1  mA  and  4.7  mA  with  an  average  value  of  4.4  mA  .  Each 
laser  has  a  maximum  single-mode  output  power  in  the  order  of  300  jiW.  Above  this  value  higher 
order  transverse  modes  appear  in  the  emission.  By  proper  wedge  shaping  of  the  epitaxial  layers 
during  growth,  a  variation  of  the  emission  wavelengths,  plotted  in  Fig.  3,  was  achieved,  making 
the  array  very  well  suited  for  wavelength  division  multiplexing  systems. 

The  small-signal  response  of  a  single  bonded  lasing  element  at  different  driving  currents 
above  threshold  is  shown  in  Fig.  4.  A  3  dB  modulation  bandwidth  of  4.6  GHz  is  measured  for  a 
driving  current  of  7.3  mA,  corresponding  to  an  output  power  level  of  240  /iW.  This  experiment, 
performed  with  a  not  yet  optimized  laser  structure,  shows  the  potential  of  the  array  for  high 
bit  rate  data  transmission.  Operating  all  100  elements  in  parallel,  bit  rates  of  several  hundred 
Gbit/s  can  be  obtained.  Using  laser  arrays  mounted  on  proper  heat  sinks  with  lower  series 
resistances  we  expect  still  much  higher  output  powers  [4]  and  modulation  bandwidths. 
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Fig.  1:  Vertical  structure  and  top  view  of  an  VCSEL  array  fabricated. 


Fig.  4:  Small-signal 

modulation  characteristics. 
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Abstract  16x16  arrays  of  stnined-InGaAaA3aAs  S-SEEDs  have  been  tuccettfully  operated  with  a  diode- 
pumped  1064  nm  laser.  These,  and  other  GaA^QaAlAs  devices,  are  being  incorporated  into  digital  parallel 
optical  processing  demonstrators. 

The  Scottish  Collaborative  Initiative  on  Optoelectronic  Sciences  (SCIOS)  is  a  partnership  between 
Heriot-Watt,  Glasgow,  St  Andrews  and  Edinburgh  Universities  that  has  been  running  since  1990.  It  has  as  its 
objectives: 

(i)  to  pursue  research  into  the  physics,  materials  science,  fabrication  and  packaging  technologies 

underlying  the  development  of  optoelectronic  devices,  and 

(ii)  to  combine  the  results  of  (i)  with  new  system  architecture  concepts  so  as  to  realise  free-space  parallel 

optical  computing  and  information  processing  demonstrators  experimentally. 

The  SCIOS  technology  programme  includes  work  on  m-V  semiconductor  devices,  diffractive  optics,  spatial 
light  modulators  and  sotid-saue  lasers.  Recent  emphasis  has  been  on  smart  pixel  arrays  including  hybrid 
silicon/InGaAs  devices.  In  addition,  new  digital  free-space  system  architectures  are  being  explored  and  a 
range  of  novel  demonstrators  constructed. 

SEED  arrays  [1]  operating  at  850  nm  are  proving  to  be  effective  as  digital  photonic  logic  planes  in 
experiments  on  parallel  optical  processing  and  switching  systems  [2,3].  However,  system  speed  can  be  limited 
by  the  degradation  of  SEED  performance  at  the  required  high  optical  power  [4,5]  due  id  local  heating, 
saturation  and  field  screening.  Woodward  et  al  [6,7]  showed  that  both  the  Joule  heating  associated  with  the 
photoresponse  and  the  degradation  in  performance  at  high  inadiance  could  be  avoided  by  using  material  with 
a  radically  decreased  non-radiative  earner  lifetime  t  ;n  the  electroabsorptive  region.  However,  the  ion- 
impl&ntatioo  method,  used  in  Refs  6  and  7,  broadens  the  exciton  absorption  peak  and  reduces  the  modulation 
performance.  We  have  made  MQW  modulator  structures  [8]  in  which  the  carrier  lifetime  is  already  short  in 
the  as-grown  structure,  so  that  the  finished  device  had  both  tow  photocurrent  and  good  modulation 
performance.  We  have  gone  on  to  demonstrate  bow  such  MQW  material  may  be  used  in  a  novel 
electroabsorptive  optical  switching  device. 

This  device  is  based  an  vertically  integrated  modulator  and  detector  elements,  connected  electrically 
in  parallel.  The  non-QW  GaAlAs  detector  (lower  part)  has  unity  quantum  efficiency  under  reverse  bias.  The 
modulator  diode  (top  pan)  consists  of  deep  GaAs/AlAs  quantum  wells  and  has  extremely  low  quantum 
efficiency  (q=0-4%  at  15V,  q=1.4%  at  25V)  due  to  the  long  sweep-out  time  across  the  high  barriers,  coupled 
with  a  short  non-radiative  lifetime  t.  In  this  case,  the  low  value  of  ?  was,  we  behove,  produced  by  a 
deterioration  in  die  MBE  system  at  the  time  of  growth,  which  fortunately  did  not  broaden  the  exciton 
absorption  feature.  We  hope  to  reproduce  this  effect  controllably  by  growing  at  much  tower  temperatures. 

At  851  nm,  a  combination  of  the  quantum  confined  Stark  effect  in  the  MQW  modulator  and  die 
Franz-Keldysh  effect  in  the  detector  combine  to  give  N-type  negative  resistance,  tending  to  a  factor  of  3 
decrease  in  the  responsivity  of  the  whole  structure  between  OV  and  15V  for  10|iW  incident  optical  power.  This 
behaviour  is  maintained  at  intensities  8  times  higher  than  for  a  conventional  MQW  SEED  [4,5].  To  show  how 
this  structure  can  be  used  to  make  a  high  performance  reflection-mode  SEED- type  switch,  we  have  recently 
made  devices  with  optical  output  provided  by  a  partial  mirror  (70%  reflectivity)  between  die  modulator  and 
detector.  A  contrast  ratio  of  4  and  a  high  state  reflectivity  of  40%  are  expected.  Cootrol  of  photo-response  is 
likely  to  be  an  important  factor  in  the  design  of  future  smart-pixel  devices  based  on  semiconductor  modulators. 

The  clock-rates  for  demonstration  parallel  digital  optical  processing  systems  based  on  current- 
generation  SEED  arrays,  operating  at  850  nm  wavelength,  are  limited  by  the  diode  laser  power  reaching  the 
device  array  (milliwatts).  The  consequent  demand  for  a  better  match  to  higher-power  (>  1  watt)  lasers,  such  as 
diode-pumped  NrhYLF  and  NtfcYAG,  has  prompted  die  development  of  strained  InGaAs/GaAs  multiple 
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quantum  wells  (MQW)  grown  by  molecular  beam  epitaxy  (MBE)  for  SEED*  operating  at  1047-1064  tun. 
Previously  reported  devices  in  this  materials  system  [9.10,11]  had  insufficient  contrast  mtio  and  too  high 
operating  vohage  for  use  in  switching  systems.  We  have  obtained  enhanced  device  performance,  compared  to 
our  previously  reported  device  [11],  by  unproved  growth  techniques,  ami  have  demonstrated  for  the  first  time 
operation  of  arrays  of  SEEDs  at  1064  nm. 

In  our  S-SEED,  the  pin  MQW  modulator  is  grown  strain-balanced  to  a  deliberately  relaxed  buffer 
layer  of  intermediate  lattice  constant,  such  that  the  InGaAs  wells  are  in  compression  ami  balance  the  OaAs 
barriers  in  tension.  Growth  was  by  MBE  on  a  [100]  orientated  semi-insulating  QaAs  substrate.  To  minimise 
the  gliding  of  dislocations,  and  hence  avoid  broadening  of  the  excitonic  absorption  peak,  a  low  temperature 
was  used  with  a  low  arsenic:group-in  flux  ratio  to  keep  the  mobility  of  the  group- IH  atoms  high.  The  buffer 
layer  consisted  of  an  i-InGaAs  layer  with  10-step  grading  of  In  composition  from  0  to  13.5%.  followed  by  an  i- 
InAlAs/GaAs  superlattice.  The  structure  was  designed  to  be  completely  relaxed  at  the  top  of  the  graded 
InGaAs  layer,  but  X-ray  diffraction  measurements  showed  it  was  only  71%  relaxed.  The  room  temperature 
absorption  peak  occurred  at  1061  nm  rather  than  the  design  wavelength  of  1047  nm.  However,  the  variation  in 
this  wavelength  was  only  ±0.2  nm  across  the  central  1*  of  foe  2"  diameter  substrate.  The  half-width  half 
maximum  on  the  low  energy  side  of  this  peak  was  6.0  meV  (5.4nm),  which  is  the  best  reported  for  this  system. 
It  is  comparable  to  foe  figures  of  6.8  meV  for  MOCVD -grown  InGaAs/GaAsP  [12]  and  5.25  meV  for  gas- 
source  MBE-grown  InGaAs/InGaP  [13],  for  both  of  which  foe  MQWs  are  strain-balanced  to  foe  substrate.  It 
also  compares  well  to  4.5  meV  for  similar  GaAs/GaALAs  MQWs  [1]. 

For  initial  optical  testing  with  a  tunable  laser,  200  pm  diameter  structures  were  fabricated  by  wet 
etching  and  metallisation.  The  front  of  foe  sample  had  a  partial  anti-reflection  coating,  the  bade  was  polished 
but  uncoated.  The  photocurrent  characteristics  showed  98%  quantum  efficiency  at  0V,  100%  at  IV  reverse 
bias.  The  transmission-voltage  performance  shows  a  contrast  ratio  of  2.0  between  0  and  10V  at  1060  nm. 
which  increases  to  4.2  by  the  incorporation  of  a  mirror. 

S-SEED  arrays,  ranging  in  size  from  16x16  (20  pmx20  pm  windows)  to  48x96  (9216  diodes  with 
7  pm  windows),  have  been  fabricated.  The  devices  incorporate  a  metal  minor  on  foe  top  for  reflection-mode 
operation,  with  the  light  passing  through  the  GaAs  substrate  and  the  temperature  stabilised  sapphire  mount 
Anti-reflection  coatings  have  been  applied  at  all  interfaces.  The  whole  package  is  compatible  with  our 
optomechanical  system  used  for  optical  processing  deminstntors  [2],  Individual  S-SEEDs  show  bistability 
with  3.04.2  contrast  ration  at  10V  bias  for  wavelengths  of  1059-1064  nm.  Similar  arrays  with  one  bond-pad 
for  each  S-SEED  have  also  been  fabricated  for  flip-chip  bonding  to  silicon  CMOS  to  make  a  spatial  light 
modulator. 
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Abstract 

Wafer  level  optical  and  electronic  testing  of  smart  pixel  arrays  is  a  must  if  a  high  yield  technology  is  to  be 
developed.  Our  results  on  GaAs  FET-SEED  based  switching  nodes  show  the  uniformity  and  performance  control 
levels  achievable  with  today's  technology  ,  as  well  as  its  limits. 

The  fabrication  of  monolithically  integrated  semiconductor  microelectronics  components  with  optical  devices  must 
be  a  high  yield  process  in  order  to  accomplish  large  scale  manufacturing  of  smart  pixel  array  s  for  optical 
interconnects  and  switching.  Smart  pixels  consist  of  optical  input  and  output  devices,  whom  performance  is 
enhanced  with  logic  and  amplifying  circuitry.  We  present  here  4x4  arrays  of  2x1  photonic  switching  nodes  [1] 
made  using  the  field-effect-transjstor  self  electro-optic  effect  device  (FET-SEED)  technology  [2,3].  Although 
simple  circuits  can  switch  as  fast  200ps  [4].  and  fully  functional  individual  nodes  operating  at  400Mb/s  have  been 
demonstrated  [1,5],  the  speed  of  an  entire  array  was  only  155Mh/s,  limited  by  the  non-uniformity  of  the  individual 
nodes.  We  have  undertaken  the  task  of  systematically  mapping  the  performance  of  the  arrays  at  the  wafer  level,  in 
order  to  identify  failures  and  their  origin. 

A  node  is  schematically  shown  in  Fig.1  It  consists  of  an  optical  receiver,  an  inverter,  a  control  memory  and  a 
multipfexer/driver/transmitter.  The  electrical  output  A  of  the  receiver  becomes  one  of  the  inputs  to  the  2x1 
multiplexer/driver  located  within  the  same  node.  The  other  input,  B,  conies  from  a  receiver  located  in  another 
node.  Each  mux/driver  also  has  a  pair  of  complementary  electrical  inputs  (Q  and  Q  in  Fig.1),  whose  role  is  to 
control  which  one  of  the  inputs  A  or  B  is  regenerated  as  the  optica)  output  C.  The  control  memory  (set-reset  latch) 
stores  this  control  bit  An  electrical  control  signal,  vcc  ,  common  to  all  the  nodes  in  the  array,  is  held  either  high, 
to  enable  writing  of  the  memories  with  the  control  bits  preceding  the  data  bits,  or  low,  to  preserve  the  memory 
during  the  time  the  data  are  processed. 


Fig.1.  Schematic  diagram  of  a  single  node.  Notice  the  four  "indicator  lights”  placed  at  the  bottom  left  (the  A  and 
A),  at  the  center  (Q),  and  the  hip  right  (Q). 
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The  circuits  contain  quantum  well  p-i-n  detector  diodes,  Schottky  damping  diodes,  single-gate  and  double-gate 
field-effect  transistors.  Single  diodes,  transistors,  resistors  and  conductors  are  fabricated,  adjacent  to  each  array  of 
smart  putds,  for  individual  component  characterization.  By  measuring  their  characteristics  and  mapping  them 
over  the  area  of  the  wafer  we  obtain  the  yields  of  individual  devices,  as  well  as  the  degree  of  uniformin'.  These 
yields  arc  in  general  vety  high,  of  the  order  of  90%  and  better.  An  example  of  *t*ta  mapping  on  a  3”  wafer  is 
shown  in  Fig.  2,  for  threshold  voltages  of  FETs  with  10pm  long  gates.  The  figure  shows  good  uniformity.  A 
variation  of  lOOmV  is  observed  within  a  2”  diameter,  i.e.  50%  of  the  useful  area;  the  variation  of  Vth  over  the 
entire  area  is  about  300mV.  The  pattern  in  Fig.2  is  circular,  which  indicates  that  the  variation  is  likely  due  to 
thickness  and/or  doping  variation  in  the  MBE  growth,  rather  than  to  the  subsequent  processing. 


Fig.2  Mapping  of  threshold  voltages  on  a  3"  wafer. 
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Fig.  3.  Optical  functional  testing  of  the  entire  4x4 
anay.  The  first  cell  illustrates  the  four  "indicators* 
which  are  monitored:  the  signal  A  from  the  receiver, 
the  signal  X  from  the  inverter,  the  two  control 
signals  from  the  memory,  Q  and  Q. 


Testing  the  functionality  of  these  nodes  is  a  challenging  task.  Techniques  typical  to  electronic  testing  of  integrated 
circuits  are  usefbl,  but  insufficient:  our  circuits  must  perform  optical  functions  as  well.  Thus  the  testing  must 
address  both  optical  and  electronic  functions.  The  implementation  of  such  a  technique  for  testing  the  receiver, 
inverter  and  the  control  memory  is  illustrated  by  the  use  of  ’indicators"  in  Figs.  1  and  3.  Each  of  the  four  "indicator 
circuits"  in  Fig.  1  consists  of  a  quantum  well  diode  in  series  with  a  FET.  When  the  FET  is  conducting,  the  diode  is 
forward  biased  and  emits  light.  The  state  of  die  FET  is  determined  by  the  voltage  on  its  gate.  In_Fig.l,  the  gate 
voltage  on  the  indicators  is  determined  by  the  X  and  A  signals  for  the  two  lower  left  ones,  and  by  Q  and  Q  for  the 
center  and  top  left  indicator.  The  A  signals  are  simulated  electrically.  The  Q  signals  are  controlled  bv  the  voltage 
Vc*  A  periodic  sequence  ofhigh  and  low  A  signals  is  coincident  in  time  with  a  periodic  sequence  of  high  and  low 
Vgg.  Since  only  the  "high  V  allows  changing  the  stale  of  the  memoty,  die  rate  at  which  Q  and  Q  switch  is  half 
the  rate  at  which  the  A  signals  switch.  An  infrared  camera  and  a  VCR  me  used  to  record  the  "indicator  lights' 
switching.  The  "morion  picture",  a  frame  of  which  is  illustrated  in  Fig.  3,  contains  the  images  of  all  the  switching 
indicsion  for  every  array  on  the  wafer.  Similar  optical  measurements  on  the  different  types  of  nodes,  as  well  as  on 
especially  designed  "yield  tester  arrays"  (consisting  of  indicator  circuits  only),  have  allowed  us  to  assess  the  present 
ftmctional  yields.  They  vary  from  5%  to  50%,  depending  on  the  circuit  Our  preliminaiy  analysis  of  failures 
indicates  that  the  vast  majority  of  them  are  doe  to  processing  errors  which  can  be  avoided  by  using  improved 
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Abstract 

We  describe  a  noaoMtUcafly  Integrated  optoelectronic  smart  pixel  fabrication  process  using  a 
GaAs/AIGaAs  system.  The  smart  pixels  are  realized  with  depletion  type  metal  semkondnctor  field -effect 
transistors  (MESFET*),  high  brightness  Rght-amttttng  diodes  (HBLEDs)  and  photodiodes  (PDs). 


Moootithically  integrated  optoelectronic  "smart 
pixels"  are  of  current  interest  in  the  field  of 
parallel  optical  interconnects,  early  vision 
processing  and  optoelectronic  neural  networks 
[U]. 

We  have  developed  a  fabrication  process 
where,  in  a  flexible  way  different  kinds  of 
optoelectronic  smart  pixels  can  be  realized  in  die 
GaAs/AlGaAs  material  system.  The  material  is 
grown  in  a  single  step  by  metal  organic  chemical 
vapour  deposition  (MOCVD).  On  a  n+ -doped 
GaAs  substrate,  we  first  grow  a  HBLED  with  a  n- 
type  Alo.3Gao.7As/ Alo.gGao.2 As  distributed  Bragg 
reflector  of  30  pairs  [3],  The  PD/MESFET  layers 
are  grown  on  top  of  this  structure.  They  consist  of 
a  1  pm  thick  GaAs  p'-buffer/absorber  layer,  a  200 
nm  thick  1*1017  cm  3  n -doped  GaAs  channel,  a  20 
nm  thick  etch-stop  layer  of  Alo.3Gao.7As  and  a 
GaAs  n+-contact  layer.  The  p'n-junction  between 
the  buffer/absorber  layer  and  the  channel  of  the 
MESFET  is  used  as  a  PD  (Fig.  1).  The  bondgap 
difference  between  the  PD  absorber  and  the 
HBLED  emission  is  127  meV.  The  light  emitted 
by  the  HBLED  is  shifted  to  790  nm  using  a 
Alo.03Gao.95As  QW.  Therefore  die  fight  cm 
efficiently  be  detected  by  die  PD.  The  etch-stop 
layer  ensures  a  homogenous  MESFET  threshold 
voltage  over  a  large  area.  The  entire  n-p-n  layer 
structure  is  described  in  [4], 

HBLED  MESFET  PO 


Fig.  1:  Schematic  layer  requeue*  of  an  optoelectronic 
imart  pixel. 


The  fabrication  process  requires  eight 
photolithographic  steps  and  is  based  on  mesa 
isolation.  The  backside  ohmic  contact  is  formed 
first  by  a  Ge-AuNiAu  metallisation.  This  is  the  n+- 
contact  to  all  HBLEDs.  For  the  ohmic  contacts  of 
the  MESFETs  and  die  PDs  a  NiGe-AuNiAuPt 
metallisation  is  evaporated  and  annealed  at  440C°. 
Then  the  individual  MESFETs  and  PDs  are 
separated  by  magnetron  enhanced  reactive  ion 
etching  (MIE)  of  mesas.  After  the  deposition  of  a 
Si3N4  dielectric  film,  contact  areas  to  the  ohmic 
contacts  of  the  MESFETs  and  PDs  on  the  n+-side 
as  well  to  die  ohmic  contact  of  the  HBLEDs  and 
the  PDs  on  die  p+-side  are  opened  by  reactive  ion 
etching  (RIE).  The  gate  areas  are  then  also  opened 
by  ROE.  The  dielectric  film  is  used  as  a  mask  for  a 
selective,  recessed  wet-etch  of  the  gate  area.  This 
etch  removes  the  n+ -contact  layer  and  produces  an 
undercut  of  200  nm  under  the  dielectric  film.  The 
next  metallisation  (TiPtAu)  forms  the  Schottky 
contact  on  the  gate  areas.  This  metallisation  is  also 
used  as  the  p+-contact  of  the  HBLEDs  and  the 
PDs,  and  as  a  first  wiring  level.  The  HBLEDs  are 
isolated  by  a  second  level  of  MIE  mesa  etching 
The  secood  wiring  (TiAl)  level  is  patterned  by 
wet-etching  on  a  dielectric  film . 

The  fabricated  devices  are  then  characterised. 
The  MESFETs,  which  have  a  threshold  voltage  of 
-1.75V,  show  a  transconductance  of  55  mS/mm 
(Ugs*0V,  Uds=4V)  for  a  gate  length  of  1.5  pm 
and  an  effective  gate  width  of  27  pm  (Fig.  2).  The 
current  density  per  gate  length  is  66  mA/mm 
(Ugg-0V,  Ufe^V).  The  responsivity  of  the  PDs  is 
0.56  A/W  at  a  wavelength  of  790  nm.  This 
implies,  for  a  practical  optical  input  power  of  (1-5) 
pW,  a  photo  current  of  (0.56-2.8)  pA.  An  emission 
efficiency  of  0.0095  W/A  was  measured  for  the 
HBLED.  This  implies,  for  HBLEDs  of  25*25  pm^ 
active  area,  a  light  output  power  of  95  pW  at  an 
input  current  of  10  mA,  and  a  power  dissipation  of 
22  mW.  The  mean  emission  wavelength  is  792 
nm. 


We  have  simulated  a  simple  threshold  cucuit 
in  which  all  the  parasitic  devices  are  implemented 
(Fig.  3).  The  HBLED  cm  be  traned  off  by 
increasing  the  light  input  power  Pm  The  threshold 
can  be  controlled  by  clanging  the  gate  voltage 
VlH  of  da  MESFET  Ml.  The  output  current  is 
haliaaod.  i.e.  the  current  flows  either  through  the 
HBLED  or  through  the  MESFET  M2.  The 
MESFET  M3  serves  as  a  current  source  and  limits 
the  HBLED  coreat.  The  simulated  data  predict  an 
average  optoeietrooic  gain  of  260  for  a  contrast 
ratio  of  10.  The  threshold  circuit  is  realised  on  a 
chip  area  of  approximately  250250  pm2  implying 
a  maximum  pixel  density  of 450  per  cm2 . 


Fig.  2a)  The  drain  I-V  characteristic  of  a  MESFET  with 
a  gate  length  of  15  pm  and  an  effective  width  of  27  p 
m,  b)  the  quantum  efficiency  and  respousivity  vs. 
wavelength  for  a  PD  with  an  active  area  of  100*100 
pnt2,  c)  the  optical  output  power  and  diode  voltage  vs. 
injection  current  of  a  HBLED  with  an  25*25  pm^  active 


We  have  used  HPSPICE  to  simulate  an 
optoelectronic  smart  pixel.  The  GaAs  MESFET 
level  1  model  of  HPSPICE  is  serving  for  the 
simulation  of  die  MESFET.  The  po-junetkm 
model  of  HPSPICE  is  extended  to  include  a 
parallel  resistance  to  simulate  the  reverse  ament 
of  the  PD.  The  same  model  is  used  for  the 
simulation  of  the  HBLED.  The  MESFET  model  is 
also  extended  to  include  a  PD,  which  lies  under 
the  MESFET.  The  bnckgating  effect  on  the 
MESFET  is  swwriatod  by  a  voltage-controlled 
voltage  source  placed  between  the  internal  gate 
node  and  the  gate  terminal.  The  PD  is  connected 
by  the  p+-layer  to  dm  parasitic  HBLED.  To  obtain 
accurate  fits  to  the  measurements,  all  these 
parasitic  devices  mutt  be  implemented  in  the 


Fig.  3a)  The  circuit  diagram  of  the  simulated  threshold 
circuit,  b)  the  simulated  optical  output  power  vs.  optical 
input  power. 

Based  on  these  results,  we  are  designing  and 
fabricating  different  types  of  optoelectronic  smart 
pixels,  in  particular  optoelectronic  neurons  for 
optoelectronic  neural  networks. 
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Abstract 

The  performance  gains  associated  with  optical  computing  schemes  must  be  traded  off 
with  the  complexity  of  individual  computing  nodes  in  order  to  optimise  the  efficiency  of 
an  optical  processor.  We  categorise  the  necessary  trade-offs  and  provide  examples  of  such 
optimisation. 

Introduction 

The  recent  advent  of  VLSI  compatible  optically  interconnected  electronic  devices  has  triggered 
a  tremendous  interest  in  the  use  of  such  components  at  a  system  level.  Few  studies,  however, 
have  focussed  on  the  numerous  trade-offs,  benefits  and  limitations  which  are  offered  by  the  com¬ 
bination  of  the  competing  technologies  (electronics  and  optics)  [l]-[3].  To  recognize  whether  a 
task  is  worth  being  performed  in  an  opto-electronic  form  constitutes  probably  the  first  stage  of 
the  investigation.  A  definite  answer  to  such  a  question  assumes  however  that  all  trade-offs  have 
already  been  fully  quantified.  Some  tasks  are  nevertheless  recognized  as  a  priori  more  efficient 
if  optically  implemented  by  algorithms  whose  possible  mappings  take  benefit  of  the  advantages 
of  non-local  optical  interconnections.  For  example,  the  two-dimensional  sorting  can  be  achieved 
with  the  bitonic  sort  algorithm  which  uses  the  folded  perfect-shuffle  [4,  5].  It  is  the  purpose 
of  this  paper  to  carry  out  a  complexity  analysis  of  the  opto-electronic  implementation  of  such 
functions  on  optical,  algorithmic  and  electronic  grounds. 

Algorithmic  considerations 

Previous  studies  showed  that  a  bonus  factor  of  at  least  two  orders  of  magnitude  is  expected 
for  the  8-bit  sorting  if  the  computational  time  of  the  sorting  is  reduced  [6].  In  other  wends,  the 
degree  of  smartness  of  the  pixel  impacts  on  the  number  of  processing  steps  required  for  sort¬ 
ing.  Performance  metrics  of  some  algorithms,  which  all  perform  the  sorting,  will  be  presented. 
They  differ  by  the  minimum  required  intelligence  or  smartness  at  the  nodes  of  the  multi-stage 
interconnection  network  (MIN).  For  example,  the  powerful  bitonic  sort  algorithm  demands  the 
presence  of  complex  exchange-bypass  modules  [5].  In  that  respect,  it  is  expected  that  such  an 
algorithm  provides  a  large  bonus  factor  compared  to  its  electronic  counterpart.  The  algorithms 
are  implemented  within  the  Extended-Cellular-Logic-Image-Processor  (EX-CLIP)  architecture 
[6]. 

Optical  considerations 

The  algorithmic  analysis  ignores  the  feasibility  of  large  scale  implementation  in  the  optical 
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and  electronic  domain.  In  the  optical  domain,  the  limitations  on  the  upscaling  possibilities  orig¬ 
inate  from  fundamental  properties  or  practical  constraints.  For  example,  the  aberration  and 
diffraction  of  the  imaging  devices  and  the  space-variant  nature  of  the  optical  interconnections 
impose  an  upper  limit  on  the  space-bandwidth-product  available.  Technical  limitations  encom¬ 
pass  the  micro-mechanical  alignment  and  the  stability  of  the  optical  implementation. 

The  highly  partitioned  feature  of  an  opto-electronic  array  arises  from  placing  the  electronic 
circuitry  of  each  pixel  around  their  regularly  distributed  transceivers.  This  conventional  layout 
eases  also  the  design  of  the  synthetic  diffractive  elements  (DO Els)  traditionally  used  as  fan-out 
and  interconnection  optical  units  [7].  A  pixel  density  can  be  then  defined  which  is  directly  related 
to  the  degree  of  smartness  of  the  pixel.  A  multichip  configuration  [8]  increases  the  throughput 
rate  at  the  expense  of  a  larger  field  of  view  of  the  imaging  devices.  The  resulting  increased  space 
bandwidth  product  is  moreover  rarely  fully  utilized  because  of  the  low  density  of  transceivers. 
In  other  words,  the  pixel  density  is  limited  by  the  scaling  possibilities  of  the  optical  hardware. 
Solutions  have  been  proposed  to  overcome  that  limitation  such  as  the  use  of  hybrid  optical  compo¬ 
nents  [9]  or  the  topological  separation  of  the  transceivers  from  their  circuitry  [10].  The  possibility 
to  scale  up  the  architecture  will  be  analysed  for  the  different  available  arrays. 

Electronic  considerations 

The  type  of  technology  and  the  logic  family  used  for  the  electronic  circuitry  impacts  on  the 
throughput  rate  through  different  forms.  The  area  taken  by  the  logic  circuitry,  while  depending 
on  the  ability  of  the  electronic  designer,  provides  higher  bounds  on  the  pixel  density.  These 
bounds  can  be  further  modified  if  the  power-delay  product  of  the  logic  gates  and/or  the  energy 
dissipated  by  the  transceivers  prevent  efficient  off-chip  heat  dissipation  .  Comparisons  based  on 
area  and  power  consumption  will  be  made  in  the  cases  of  GaAs  S-SEEDs,  L-SEEDs,  FET-SEEDs 
arrays  [11]  as  well  as  hybrid  arrays  which  involve  SEEDs  transceivers  flip-chip  bonded  on  silicon 
based  CMOS  circuits.  These  arrays  exhibit  the  functionalities  required  for  the  different  algo¬ 
rithms  explained  above. 

Conclusion 

Tabulating  various  device/architecture/algorithm  combinations  under  the  considerations  men¬ 
tioned  above  allows  the  optimisation  of  such  combinations  in  terms  of  the  pixel  complexity.  Ex¬ 
amples  of  the  optimisation  of  sorting  and  other  transforms  will  be  presented  in  this  context  at 
the  conference. 
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Abstract 

A  highly  efficient  optical  coupling  system  using  microlens  arrays  for  free-space  optical  switching 
networks  was  proposed  and  demonstrated.  High  speed(<100  ps)  and  high  extinction  ratio  for  1.3  pm 
wavelength  were  obtained  by  a  newly  proposed  ferroelectric  liquid-crystal  driving  method. 

1.  Introduction 

Two  dimensional  optical  switching  devices  consisting  of  liquid-crystal  cell  arrays  and  birefringent 
plates  have  many  advantages  for  use  in  large-scale  and  transparent  optical  switching  networks.  We 
have  demonstrated  the  successful  operation  of  an  8-stage  optical  concentrator  using  1024-input-ports 
optical  beam  shifter  modules  for  visible  light.(1),(2)  To  use  this  type  of  module  for  optical  communica¬ 
tion  networks,  good  performance  for  long-wavelength,  highly  efficient  fiber-to-fiber  coupling  and 
high  speed  switching  operation  are  required.  This  paper  describes  the  new  optical  design  for  low-loss 
interconnection  and  a  new  method  of  driving  high  speed  ferroelectric  liquid-crystals(FLC). 

2.  Optical  design  for  fiber  interconnection 

Figure  1  shows  an  example  of  NxN  optical  switching  network/31  The  distance  between  input  and 
output  fiber  arrays  is  several  tens  of  centimeters  because  the  system  consists  of  the  cascaded  optical 
beam  shifter  modules.  The  beam  expansion  due  to  diffraction  results  in  very  large  increases  of  the 
coupling  loss  and  limits  the  size  of  switching  networks. 

Figure  2  shows  the  newly  proposed  configuration  of  lenses  for  low-loss  propagation.  Figure  2(a) 
shows  the  confocal  lens  system,  which  consists  of  a  very  short  focal  length  lens  and  a  long  focal 
length  lens,  to  obtain  long  collimation  distances.  Figure  2(b)  shows  the  relay  lens  with  long  focal 
length  for  compensating  beam  expansion.  The  number  of  relay  lenses  depend  on  the  size  of  switching 
system.  Relay  lenses  can  compensate  the  beam  expansion.  Figure  3  shows  the  changes  of  beam  radius 
along  the  propagation  direction  in  Fig. 2(a)  using  a  ball  lens(f=0.57  mm)  and  sputter-liftoff 
microlens(f=25  mm)  in  Fig.4.  Microlenses  was  fabricated  by  sputter  deposition  using  a  shadowing 
effect. 

Figure  5  shows  the  experimental  configuration  of  a  polarization  independent  switching  system.  On 
the  input  side,  an  input  beam  is  separated  into  two  components,  an  ordinary-ray  and  an  extraordinary- 
ray.  These  beams  go  through  the  two  adjacent  cells  in  each  optical  beam  shifter.  At  the  final  stage,  the 
polarization  of  each  beam  is  controlled  and  the  beams  are  combined  into  one  non-polarized  beam. 
Optical  beam  shifter  modules  with  990  pm  cell  size  and  350  pm  aperture  radius  are  used.  Two  long 
food  length  lenses,  MLl(f=150  mm)  and  ML2(f=25  mm),  are  very  effective  for  highly  efficient  opti¬ 
cal  coupling  between  an  input  fiber  and  an  output  fiber  array/41  Very  low  total  insertion-loss  of  8.7  dB 
was  obtained  for  four  output  ports.  This  value  proves  the  accurate  coupling  to  output  fibers  in  spite  of 
very  long  propagation  span  of  424  mm. 

3.  High  speed  optical  switch  for  U  pm  using  FLC 

Ferroelectric  liquid-crystal  is  a  very  attractive  material  for  high  speed  operation,  which  is  two  to 
three  orders  higher  than  that  of  twisted-nematic  liquid-crystal.  But,  there  are  two  main  problems  to 
solve  in  adapting  it  to  optical  switching  devices  for  long  wavelengths.  To  obtain  the  high  extinction 
ratio,  the  cell  gap  is  required  to  be  two  times  thicker  than  that  for  visible  light.  This  thicker  gap  re¬ 
duces  the  FLC  memory  effect  The  ordinary  bipolar  driving  method  is  not  adequate  for  optical  switch¬ 
ing  because  of  short  break  of  optical  signal  by  reset  pulse. 

To  overcome  these  problems,  we  developed  a  new  driving  method  for  FLC  optical  switch.  An 
active-matrix  driving  method  with  the  voltage  waveforms  shown  in  Fig.  6  was  used.  The  unipolar  data 
driving  pulse  can  prevent  a  short  signal-break.  The  low  voltage  part  of  the  pulse  can  sustain  the  cell 
data  without  losing  reliability  and  the  high  voltage  part  of  one  pulse  results  in  high  speed  switching 
and  high  extinction  ratio.  In  Fig.7,  high  extinction  ratio  (>30  dB)  and  low  insertion-loss(<l  dB)  were 
obtained  at  1.3  pm  wavelength.  High  speed  switching  operation(<100  ps)  was  confirmed.  This  value 
is  two  orders  faster  than  that  of  twisted-nematic  liquid-crystal. 
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4.  CoflduioH 

We  proposed  a  new  configuration  microlens  system  for  high  optical  coupling  and  experimentally 
demonstrated  low  propagation  loss.  An  FLC  driving  method  for  optical  switch  in  optical  communica¬ 
tion  networks  was  newly  proposed.  High  speed  operation  and  high  extinction  ratio  were  obtained 
using  this  new  method.  These  optical  switching  modules  are  applicable  to  large  scale  free-space  opti¬ 
cal  switching  netwodcs. 


(1) S.Sliirai  etiL  J’S’9Z2A6>tinik.l992.  (3)M.Y*m*guchi  ec*L;NTT  REVTEWJ,p62.1993. 

(2) M.Yamaguchi  euU.;PS'92,lA5.Miiuk,1992.  (4)K.Koyibu  etaL;OFC/IOOC*93  Tech.  Dig  .TuB2.1993. 


Fig.3  Propagation  profiles  for  collimation  lens  system. 
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(b)  Relay  lens 

Fig.2  Microlens  configuration  for 
long-span  beam  propagation. 


Rg.4  Photograph  of  sputter-liftoff 
microlens  array. 
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(b)  Wane  tome 

Fig. 6  New  driving  method  for  ferroelectric  liquid-crystals. 


Fig.7  Dependence  of  extinction 
ratio  on  wavelength. 
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Abstract 

Micro-optical  beam  deflectors  with  a  high  transmittance  of  95%,  a  large  deflection 
angle  of  15*  ,  and  with  a  structure  of  liquid  crystals  sandwiched  by  microprism  plates  are 
used  in  free-space  optical  interconnections  and  optical  switches. 

Introduction 

Various  kinds  of  optical  beam  deflecting  devices  such  as  holographic  diffraction 
cells1*3  have  been  developed  for  dynamic  free  space  optical  interconnections.  However  they 
have  the  drawbacks  of  low  efficiency,  low  beam  steering  angle,  undesired  high  order 
diffraction  peaks  and  the  limited  wavelength  range.  The  liquid  crystal  (LC)  microprism  array, 
which  is  an  LC  cell  sandwiched  between  substrates  that  have  notches,  was  originally 
developed  by  Sato  et  al^.  We  have  improved  these  for  microbeam  optical  interconnections,  and 
achieved  high  transmittance  of  95%  and  high  deflection  angle  of  15  degrees.  We  have  also 
developed  optical  interconnections  and  free-space  optical  switches  that  use  these 
interconnection. . 

Device  structure  and  principle _ flf._Jt.eaia _ deflection 

The  structure  of  the  LC  microprism  array  is  shown  in  Fig.  1.  It  is  a  homogeneously 

aligned  nematic  LC  cell.  Micro  prisms  with  a  pitch  of  250  pm  are  fabricated  on  one 
transparent  plate,  which  is  then  covered  with  a  transparent  indium  tin  oxide  (ITO)  electrode. 
The  input  optical  beam  is  refracted  at  the  prism  plane  according  to  Snell's  law.  When  voltage  is 
applied,  the  LC  molecules  are  realigned  and  the  refractive  index  varies  from  ne  to  no.  Thus  the 
deflection  angle  changes  with  applied  voltage.  Larger  deflection  angle  can  be  obtained  by  using 
an  LC  with  larger  refractive  anisotropy.  The  dependence  of  beam  deflection  angle  on  applied 

voltage  is  shown  in  Fig.  2.  For  the  apex  angle  <|>  of  40’  the  deflection  angle  was  15  ’  The 
transmittance  was  95%,  independent  of  the  applied  voltage. 

Rficonfigurable _ optical _ interconnections 

We  used  the  LC  microprism  array  beam  deflectors  in  optical  interconnections  between 
2-D  optical  switches  as  shown  in  Fig.  3(a).  We  used  2-D  fiber  arrays  (8x8)  with  a  pitch  of  250 

pm  for  the  input  and  a  CCD  camera  for  the  output.  The  results  for  crossing  optical  beams  are 
shown  Fig.  3(b). 

free-spact _ oalifai _ switch 

Two  crossed  LC  microprism  arrays,  between  which  a  A/2  plate  is  inserted,  can  also  be 
used  as  a  free-space  optical  switch  as  shown  in  Fig.  4(a).  1x9  switching  is  demonstrated  in 
Fig.  4(b).  The  optical  beam  can  be  deflected  to  any  point  within  an  area  of  5  mm  x  3.8  mm  on 
a  plane  30  mm  from  the  LC  microprism  deflectors. 

Summary 

Micro  optical  beam  deflectors  (LC  microprism  arrays)  were  developed  for 
reconfigurable  free-space  optical  interconnection  and  electrical  alignment  of  optical  beams. 
These  devices  can  deflect  closely  spaced  optical  beams  individually  to  any  position  with  high 
transmittance  of  95%,  high  deflection  angle  of  15*  and  low  voltage  of  3V.  Reconfigurable 
optical  interconnections  and  various  optical  networks  can  be  achieved  simply  by  changing  the 
voltage  applied  to  each  miroprism  array.  Furthermore  the  LC  microprisms  were  also  shown  to 
be  useful  for  large-scale  free-space  optical  switches. 
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Fig.  3  Optical  interconnection  by  the  LC 
microprism  array 


Fig.  4  Optical  beam  switching  by  the 
LC  microprism 
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This  paper  describes  our  recent  liquid  crystal  on  silicon  spatial  light  modulators.  We  will 
present  the  most  recent  results  from  our  256  by  256  binary  reflection  mode  SLM  and  our 
128  by  128  analog  SLM. 


Introduction 

Liquid  crystal  on  silicon  spatial  light  modulators  are  made  by  placing  a  thin  layer  of  liquid 
crystal  directly  on  top  of  a  silicon  chip.  For  a  recent  review  of  this  technology  see,  for 
example,  reference  [1].  We  have  constructed  a  256  by  256  binary  SLM  for  application  in 
an  optical  correlator  which  uses  a  ferroelectric  liquid  crystal  as  the  light  modulating  layer. 
We  are  also  in  the  process  of  constructing  an  analog  SLM. 


The  256  by  256  binary  SLM 

The  SLM  is  based  on  a  7mm  by  7mm  CMOS  die  fabricated  by  US2  under  a  1.2  micron 
design  rule.  The  active  array  area  is  a  square  approximately  5.53mm  on  a  side.  Each 
pixel  in  the  array  is  addressed  by  a  row  select  wire  and  a  column  data  wire.  The  row 
wire  activates  the  pixel  transistor  gate  and  the  column  wire  presents  a  binary  data  signal 
to  the  pixel.  On  activation  of  the  gate  wire  the  data  on  the  column  wire  is  written  to 
the  pixel,  where  it  is  capacitively  stored.  The  resulting  electric  field  between  the  pixel 
mirror  and  a  transparent  electrode  on  a  piece  of  cover  glass  drives  the  liquid  crystal  into 
the  desired  state. 

Data  are  transferred  to  the  SLM  over  32  parallel  lines  under  the  control  of  a  master 
clock  and  a  frame  sync  signal.  We  have  demonstrated  the  addressing  of  the  SLM  with  a 
master  clock  frequency  of  48MHz  which  gives  an  image  refresh  rate  of  23.5kHz.  This  is 
data  rate  of  1.6Gb/s  from  the  driver  board  to  the  SLM.  To  achieve  these  data  rates  the 
SLM  backplane  utilised  on-chip  clock  and  control  signal  generation  and  data  pipelining. 

The  most  important  design  criteria  for  the  correlator  application  were  the  optical 
efficiency  and  the  frame  rate.  In  a  correlator  which  uses  identical  SLMs  in  the  input  and 
Fourier  planes,  the  amount  of  power  in  the  output  is  proportional  to  the  fourth  power  of 
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the  flat  fill-factor.  The  use  of  minimum  geometry  polysilicon  wires  for  the  pixel  gates  helps 
maintain  the  fill-factor,  but  requires  us  to  take  account  of  the  relatively  slow  propagation 
of  gate  signals  across  the  array.  The  control  circuits  within  the  chip  are  designed  to  start 
activating  the  gate  line  for  a  row  simultaneously  with  the  data  input  for  that  row.  This 
data  is  then  maintained  on  the  column  data  wires  while  simultaneously  the  gate  line  is 
taken  low,  the  new  data  is  loaded  into  the  shift  register  and  the  next  gate  line  is  driven 
high.  This  scheme  allows  us  to  present  the  frame  of  data  as  a  continuous  stream  without 
interruptions  to  wait  for  the  gate  liner  to  switch. 

Results 

The  fabricated  chips  have  been  packaged  and  bonded.  Electrical  testing  shows  that  the 
shift  registers  and  clock  generation  circuits  function  as  designed.  A  cover-glass  was  fixed 
over  the  pixel  array,  spaced  with  polyimide  balls,  and  the  gap  filled  with  BDH  SCE13 
liquid  crystal  by  capilliary  action  under  vacuum.  The  liquid  crystal  was  aligned  bv  means 
of  a  rubbed  PVA  layer  on  the  cover  glass.  Measurements  of  the  optical  properties  of  the 
SLM  show  that  the  optical  switching  speed  of  the  liquid  crystal  is  50ps  (10%  to  90%  and 
90%  to  10%)  and  the  zero-order  contrast  ratio  is  70:1.  The  imaged  contrast  ratio  was 
measured  to  be  10:1.  The  device  specifications  are  summarized  in  the  table  below. 

Two  of  these  SLMs  have  been  used  in  an  optical  correlator  as  input  and  Fourier  plane 
devices  operating  in  a  binary  phase  mode.  The  system  was  operated  at  1000  correlations 
per  second,  i.e.  a  positive  input  and  filter  pattern  were  written  for  500/xs,  followed  by 
their  inverses  for  500/is.  The  camera  shutter  was  set  at  1ms  to  capture  the  output  from 
both  true  and  inverse  frames.  On  and  off-axis  inputs  consisting  of  small  targets  that  used 
0.6were  used.  With  the  laser  producing  approximately  5m W  the  correlation  peaks  in  the 
output  were  able  to  saturate  the  camera  and  yield  a  signal  to  noise  (pk/rms)  of  greater 
than  lOdB  for  most  of  the  images. 


Array  size 

256  by  256 

Pixel  Pitch 

21.6pm 

Fill-factor 

79%  1 

Flat  fill-factor 

(SHHHHHI 

Diffraction  efficiency 

8 IBB 

Throughput  into  zero  order 

3.4% 

Demonstrated  frame  load  time 

43ps 

Simulated  frame  load  time 

27ps 

Contrast  ratio 

70:1  (10:1  imaged) 

LC  switching  time 

50ps 

The  128  by  128  analog  SLM 

We  are  in  the  process  of  constructing  am  analog  SLM  with  a  40pm  pixel  pitch.  Results 
from  this  device  will  be  presented  at  the  meeting. 
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Abstract 

We  (flecuss  recent  advances  in  the  design  and  fabrication  of  electronically  addressed 
for  roolac  trie  tkmkf  crvatel  over  alicon  aoatlal  Itaht  modulators.  We  summarise  the  nminrwtii 
for  further  advances  In  the  near  future. 

BACKGROUND 

The  Spatial  Light  Modulator  (SLM)  is  a  key  component  in  many  optical  computing  systems.  The  SLM 
technology  of  Ferroelectric  Liquid  Crystal  over  Very  Large  Scale  Integrated  (FLC/VLSJ)  silicon  has 
matured  considerably  over  tie  past  few  years.  Eiectronicaly  Addressed  SLMs  (EASUMTs)  of  medium 
resolution  (12 8*  pixels  or  more)  have  been  reported  by  several  labs  [1];  systems  containing  multiple 
devices  have  been  reported  [2].  We  are  now  designing  custom  devices  with  system-specific 
performance  characteristics. 

INTRODUCTION 

FLC/VLSi  SLMs  are  produced  by  sandwiching  a  thin  layer  of  FLC  between  a  custom  designed  siioon 
backplane  and  a  cover  glass  coated  on  the  inside  with  a  transparent  conductive  electrode.  Most 
EASLM  backplane  designs  are  based  on  a  pixel  circuit  consisting  of  one  active  element  •  a  MetaJ- 
Oride-Semiconducior  Field  Effect  Transistor  (MOSFET)  which  acts  as  a  switch  to  control  the  amount 
of  charge  stored  on  a  capacitive  storage  element  -  a  small  metal  mirror  on  the  surface  of  the  siicon. 
The  voltage  tints  generated  produces  an  electric  field  which  alters  the  state  of  a  thin  overlying  layer  of 
FLC  to  produce  a  binary  phase  or  amplitude  modulation  in  an  incident  wavefront. This  scheme 
provides  the  smalleet  possible  pixel  and  thus  the  highest  density  of  pixels;  it  is  analogous  to  the 
purely  electronic  Dynamic  Random  Access  Memory  (DRAM)  ceK.  This  type  of  pixel  suffers  from  light 
induced  charge  lerirage  which  is  manifest  as  a  reduction  in  contrast  ratio  with  incident  fight  intensity 
thus  limitting  the  maximium  light  level  at  which  it  cm  be  operated.  There  is  also  a  limit  to  the 
spontaneous  polarization.  Ps,  of  the  FLC  material  which  can  be  used  -  the  pixel  capacitor  must  store 
enough  charge  to  switch  it  The  drive  to  reduce  the  pixel  size  is  compromised  by  the  need  to  maintain 
an  opticaly-flat  metal  area  to  act  as  the  reflective  aperture  (or  mirror).  Underlying  circuit  elements 
such  as  transitors  or  interconnect  cause  undulations  in  the  overtying  part  of  the  mirror;  these  cm 
cause  non  uniform  optical  contrast  across  a  mirror,  losses  due  to  scattering  and,  in  coherent  systems, 
phase  variations.  A  flat  fil  factor  (flat  mirror  area  /  pixel  area)  of  around  25%  has  become  a  de  facto 
rranimum. 

CURRENT  DESIGNS 

We  have  previously  reported  a  176  x  176  D  RAM-type  pixel  array  [3]  .The  frame  rate  was  Bmited  by 
the  RC  time  constant  of  the  relatively  high  reeietance  potysfiicon  row  access  fines.  A  512  x  512  pixel 
array  based  upon  the  original  176  x  176  device  has  been  designed.  The  primary  modification  has 
been  the  use  of  aluminium  to  replace  the  polysificon  row  access  fines  within  the  pixel  array.  The 
finished  design  has  been  fabricated  by  Austria  Mfcro  System  and  is  undergoing  electrical  testing. 

An  aftsmative  to  the  single  transistor  pixel  design  above  is  based  around  m  enhancement  of  the  six 
transistor  Static  RAM  (SRAM)  cel.  The  enhancement  involves  inserting  a  simple  logic  gate  between 
the  memory  and  the  mirror.  This  alows  some  of  the  addressing  requirements  of  the  FLC  to  be  met 
more  easiy.  The  SRAM  design  overcomes  si  of  tite  above  efisadvantages  of  the  DRAM  pixel  at  the 
expense  of  increased  trmoietor  count  leading  to  increased  pixel  area  and  decreased  chip  yield.  In 
particular  it  maintains  its  state  indefinitely,  allows  tire  use  of  the  fast-switching,  high  Ps,  FLC  materials 
and  shows  no  variation  of  contrast  ratio  with  incident  fight  intensity  over  a  wide  range  of  input 
intensity.  We  have  demonstrated  a  fully  working  256  x  256  pixel  array  built  in  1.2um  CMOS 
technology.  We  have  also  demonstrated  the  principle  of  grey  scale  on  this  binary  device  by  means  of 
liqnri  nuipvnng  m  toquiiM  names. 
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SILICON  FABRICATION  ISSUES 

We  have  suoceefufly  appfied  a  post-proceeaing  backplane  planarisation  technique  to  the  176  x  176 
DRAM  device  which  has  allowed  a  flat  mirror  to  ba  placed  on  top  of  tha  existing  circuitry.  The 
technique  involves  the  deposition  of  a  thick  dtoloctric  layer  which  is  subsequently  pofished  flat;  this 
slows  tie  deposition  of  a  further  metal  layer  vrtach  forms  a  flat  mirror  covering  almost  the  entire  pixsi. 
The  ntanarisalion  technioue  is  described  in  data!  oliewtinrn  141  LC  cel  construction  has  been  carried 
out  auccesfUfly  on  ptonarised  badqflanes.  The  increased  flat  fl  factor  of  the  backplane  increases  the 
ight  throughput  of  the  finished  SLM  and  reduces  tie  stray  light  reaching  toe  substrate. 

SUMMARY  AND  FORWARD  LOOK 

Table  1  summarises  the  current  situation,  high  resolution  devices,  based  on  both  DRAM  and  SRAM 
pixels,  have  been  demonstrated.  The  pixel  designs  lend  themselves  to  use  in  drfterent  appficabons 
areas.  The  planarisation  process,  which  significantly  enhances  device  performance,  has  been 
demonstrated  on  one  design,  it  is,  in  principle,  equafly  applicable  to  al  of  the  devices  of  Table  1. 
(dearly,  to  be  opticaly  useful,  the  512  DRAM  device  requires  to  be  planarised.) 

Given  the  current  (minimum  feature  size  and  maximum  die  size)  limitations  of  using  commercial 
sflioon  vendors  it  should  be  possible  to  pursue  the  DRAM  technology  to  1 024  x  1 024  and  the  SRAM 
to  512  x  512  with  frame  rates  comparable  to,  or  better  than,  those  of  Table  1.  Beyond  that  there  is  a 
Italy  need  for  access  to  specialised  memory  fabrication  processes. 


Device 

176  DRAM 

512  DRAM1 

16  SRAM2 * 4 

50  SRAM 

256  SRAM 

Date 

1999 

1994 

1996 

1968 

1994 

Sioon  PT0C688 

1 . 11  Ti  i"  ■ '  Ml . in’ll  »1T  'i . «'  1 

EjWKKKKKM 

30 

200 

72 

CZBiHHH 

235 

(not  square) 

145 

(not  square) 

110x110 

19x19 

26x26 

26x26 

196x196 

NA 

UlT"  73 M 

?WY*smm 

26/75 

16/75 

BM 

31 /NA 

23/81 

L1  v.iv:* if.i  T»ai 

250 

1000 

1 

NA 

85 

Frame  rate  (Hz) 

1000 

250*  ““1 

5* 

20» 

4000 

1  undotpoinQ  otoc  ironic  tstMny  at  ttmo  of  writ 

2  cunrenfly  undergoing  re-design  from  NMOi 

ing 

» to  CMOS 

*pMnflnzaDon  not  yw  oompMoa 

4  does  not  include  dedudiond  tor  oontact  or  via  holes 

5  predated  from  SPICE  simulations 

*  measured  using  slow  nematic  Squid  crystal 
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Abstract 

Two  kinds  of  optical  pattern  recognition  methods  based  on  multiple  correlations  are 
applied  to  actual  scenes.  The  ability  of  those  methods  to  locate  multiple  objects  is 
discussed.  The  results  of  computer  simulations  are  also  presented. 


Optical  system  based  on  a  optical  correlator  has  die  ability  to  perform  fast  pattern  recognition 
and  such  fast  operation  is  just  needed  in  machine  vision  system.  To  evaluate  the  ability  of  optical  system 
for  machine  vision,  we  have  attempted  to  apply  optical  multiple  correlation  methods  to  some  actual 
scenes. 

Pattern  recognition,  we  intend,  is  described  in  the  three  steps;  locating  all  multiple  objects  in 
the  input,  zooming  one  of  the  object  images  up  to  die  appropriate  size,  and  understanding  the  meaning 
of  zoomed  object  We  focus  here  on  achieving  die  first  step,  locating  objects,  by  optical  correlators, 
which  step  is  important  and  basic  for  other  steps. 

Two  procedures,  monochromatic  processing  and  color  processing,  were  applied  to  some 
actual  scenes  obtained  around  the  street  crossings.  These  test  scenes  have  individually  different  image 
complexities  in  their  backgrounds.  One  of  them  is  shown  in  Fig.l  (a).  In  these  scenes,  the  objects  are 
road  signs.  To  perform  shift-invariant  and  size-invariant  processing,  we  made  the  procedures  based  on 
multiple  correlation  by  using  the  well-known  correlation  filters  such  as  a  synthetic  discriminant  function 
(SDF)  filter1)  and  a  minimum  average  correlation  energy  (MACE)  filter^)  as  described  in  die  following. 
They  can  be  realized  optically  by  using  optical  multiple-correlator. 

1.  Monochromatic  Processing 

Both  of  SDF  filters  and  MACE  filters  have  been  applied  to  actual  scenes.  Tire  filters  are 
designed  to  discriminate  6  kinds  of  road  signs  by  using  33  training  patterns  and  to  catch  14  different 
sizes  of  objects.  Each  of  filters  catches  one  kind  of  road  sign  with  two  different  sizes.  The  processing 
procedure  is  described  as  follows:  the  input  image  is  firstly  correlated  with  all  filters  by  the  multiple- 
correlator;  the  correlation  patterns  are  binarized  to  pick  out  the  correlation  peaks  by  a  threshold  device; 
and  finally  they  are  added.  Bright  spots  in  the  final  image  denote  the  locations  of  objects. 

The  resnlt  of  computer  simulation  by  applying  MACE  filters  to  die  scene  of  Fig.l  (a)  shows 
that  all  time  objects  inside  the  scene  can  be  picked  up  but  many  false  signals  are  produced  by 
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background  image  such  ae  trees.  In  other  results,  all  objects  coukl  not  perfectly  delected  in  many  cases 
and  many  false  signals  appeared. 

2.  Color  Processing 

To  improve  the  above  procedure,  we  introduced  a  color  processing,  which  achieves  individual 
spatial  filtering  on  rad  (R),  green  (O),  blue  (B),  and  gray  (W)  components  of  original  polychromatic 
image.  We  choose  the  color  components  in  the  domain  of  cbromaticity  coordinates  as  a  vector  (2,  -1,  -1) 
for  R  component,  (-1,  2,  -1)  for  O,  (-1,  -1,  2)  for  B,  and  (1,1,1)  for  W.  It  can  be  realized  optically  to 
separate  an  input  image  to  the  color  components. 

In  this  processing,  the  filters  designed  for  each  coles’  component  of  object  are  applied  to  die 
corresponding  color  component  of  input  image  (component  processing)  in  the  same  manner  as  in  the 
monochromatic  processing.  Final  result  is  obtained  by  executing  AND  operation  on  all  results  from 
component  processings,  that  means  final  signals  denoting  die  object  locations  will  appear  only  when  all 
component  processing  detect  objects  in  the  same  locations  of  die  inputs. 

Figure  1  (b)  shows  the  result  of  computer  simulation  by  applying  this  processing  to  the  image 
of  Fig.l  (a).  The  SDF  filters  are  used  and  they  are  designed  in  die  same  condition  as  described  in 
monochromatic  processing  except  for  using  the  color  components  as  the  training  images.  Three  objects 
are  perfeedy  located  without  false  signals  in  this  case. 

Other  simulations  denotes  the  color  processing  yielded  always  good  results  compared  with  the 
case  of  monochromatic  processing.  But  die  color  processing  also  produced  false  signals  when  the 
background  of  input  image  has  heavy  complexities  and  it  is  a  large  drawback  to  be  overcome  now. 


(a)  (b) 

Rg.1  Results  of  computer  simulation  to  locate  the  road  signs:  (s)  shows  original  scene  and  (b)  is  the 
result  obtained  by  color  processing.  The  brightness  of  (b)  is  reversed. 


1)  D.Caaasunt,  "Unified  synthetic  discriminant  function  computational  formula,"  Appl.  Opt.  23, 
1620(1984). 

2)  A.Mahabmobts,  B.V.K.Vijaya  Kumar,  and  D.Caaasent,  "Minimum  avenge  correlation  energy 
filters,"  AppL  Opt  26, 3633(1987). 
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Abstract 

We  propose  a  new  algorithm  to  design  multiple-object  discriminant  correlation  filters. 

This  technique  rr  1  »s  it  possible  to  control  the  sidelobe  levels  of  the  filter  with 

considering  die  nic  range  of  recording  medium. 

Summary 

The  technique  of  using  optical  correlators  for  optical  pattern  recognition  has  been  well 
studied.  To  obtain  high  performance  of  recognition,  it  is  necessary  to  use  optimum 
correlation  filters  suitable  for  an  object  of  recognition.  We  studied  an  algorithm  to  design 
correlation  filters  for  pattern  recognition  in  the  presence  of  distortions  or  for  multiple-object 
pattern  discrimination.  As  this  type  of  correlation  filters,  the  Synthetic  Discriminant 
Function1  (SDF)  filters  are  well  known.  The  main  feature  of  SDF  is  that  we  can  specify  the 
correlation  values  for  training  patterns  used  to  calculate  the  function.  Some  filters,  which 
can  be  regarded  as  modifications  of  SDF,  are  proposed  to  suppress  sidelobes  and  produce 
sharp  correlation  peaks. 

However,  the  output  correlation  values  for  the  sidelobe-suppress  filters  are  very 
sensitive  to  distortion  of  input  patterns.  The  request  for  sharpness  of  correlation  peaks  and 
the  distortion  invariance  may  conflict.  Therefore  we  should  be  paid  attention  to  the 
equilibrium  of  diem. 

In  addition,  the  filters  have  wide  range  and  gentle  gradation,  in  general.  So,  it  is 
difficult  to  record  the  filter  function  correctly  on  a  medium.  The  incorrect  recording  may 
detract  from  die  correct  discrimination. 

To  solve  these  problems,  we  have  tried  to  calculate  the  correlation  filter  functions 
parametrically  by  using  the  optimization  algorithm.  We  set  up  a  cost  function,  which  is  a 
weighted  summation  of  some  feature  values  calculated  from  the  filter  function.  As  die 
feature  values,  we  use  the  correlation  energy,  squared  error  between  correlation  value  and 
specified  one  at  the  origin,  and  variance  of  filter  function  values.  We  regard  these  values  as 
the  sidelobe  level,  the  accuracy  of  specification  of  correlation  values,  and  die  tolerance  for 
distortion  of  input  pattern,  respectively.  By  adjusting  the  weight  coefficients,  it  can  be 
possible  to  control  the  characteristics  of  filter.  The  dynamic  range  of  recording  medium  is 
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used  as  the  constraint  in  optimization.  By  using  an  optimization  algorithm,  the  filter 
function  which  gives  the  minimum  value  erf  the  cost  function  can  be  obtained. 

As  the  optimization  algorithm,  we  have  introduced  the  Simulated  Annealing  (SA) 
algorithm.  We  studied  the  case  of  recording  multiple-object  discriminant  correlation  filters 
as  Lohmann  type  Computer  Generated  Holograms  (CGHs)  with  discrete  values  of  amplitude 
and  phase  components. 

A  computer  simulation  was  performed  to  estimate  die  performance  of  die  filter.  We 
compared  the  filter  to  some  other  filters,  such  as  SDF,  die  Minimum  Average  Correlation 
Energy2  (MACE)  filters,  and  their  phase-only  versions3.  Figure  1  shows  the  example  of 
simulation  results.  Figure  1  (a)  is  the  cross-correlation  pattern  of  die  filter  calculated  by  SA 
with  a  priority  of  suppressing  sidelobes.  The  amplitude  and  phase  components  of  filters  are 
represented  with  5  and  4  levels,  respectively.  It  shows  that  the  sharpness  of  die  peak  is 
excellent  Figure  1  (b)  is  of  the  MACE  filter  which  components  are  quantized  to  the  same 
levels.  Computer  simulation  results  shows  that  the  algorithm  makes  it  possible  to  obtain 
multiple-object  discriminant  correlation  filters,  which  gives  expected  correlation  response, 
on  a  discrete  type  recording  medium.  In  addition,  the  correlation  responses  erf  the  filters  can 
be  controlled  parametrically. 


(a)  (b) 

Figure  1 .  Typical  correlation  plane  for  (a)  a  filter  by  SA  (b)  a  quantized  MACE  filter. 
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An  implementation  of  a  Vander-Lugt  correlator,  which  operates  with  a  single  spatial  light 
modulator  is  proposed.  Optical  phase-retrieval  manipulation,  based  on  the  simmetrization  of  the 
information,  is  required.  Some  experimental  results  are  also  presented. 

In  recent  years,  some  authors  have  proposed  the  same  compact  correlation 
architecture1'3  for  different  purposes.  These  setups  are  based  on  the  use  of  a  spatial  light 
modulator  (SLM)  to  introduce  and  display  information;  a  Fourier  lens  system,  a  video-camera 
(CCD)  to  register  light  distributions  and  a  computer  that  controls  the  whole  system  (see  Figure 
1).  If  the  setup  performs  as  a  joint  transform  correlator,  both  the  scene  and  the  reference  are 
displayed  on  the  SLM  and  the  CCD  in  the  Fourier  plane  registers  the  joint  interferences 
between  them.  This  power  spectrum  is  processed  by  a  computer  and  displayed  again  on  the 
same  SLM.  A  second  optical  Fourier  transform  is  obtained  and  the  correlation  is  recorded  by 
the  CCD. 

Kotzer  et  al.  proposed  in  1991  the  use  of  the 
same  architecture  to  implement  Vander-Lugt 
correlators.  The  scene  is  shown  on  the 
modulator  and  its  power  spectrum  is 
recorded  by  the  CCD.  To  recover  the  phase, 
they  obtained  the  joint  interferences  between 
the  object  and  a  plane  wave.  By  processing 
these  light  distributions  the  phase  can  be 
retrieved.  In  a  second  step,  the  reference  is 
handled  in  the  same  way.  As  a  consequence, 
the  correlation  between  the  scene  and  the 
target  is  obtained. 

In  this  communication  we  present  an 
alternative  to  Kotzer's  idea  that  simplifies  the  experimental  procedure  without  lack  of 
recognition  capability.  The  idea  of  retrieving  the  phase  lost  in  the  recording  process  is  based 
on  the  symmetrization  of  die  images  used  (see  Figures  2a  and  2b).  The  images  should  be 
placed  as  close  as  possible,  without  superposition,  in  order  to  take  advantage  of  the  spatial 
bandwidth  of  the  modulator.  Let  s(x,y)  be  the  scene,  and  let  |S(u,v)|exp(i<|>(u,v))  be  its 
Fourier  transform;  a  is  the  separation  between  the  original  image  and  its  symetric  and  f  is  the 
focal  of  the  lens.  In  the  conditions  of  Figure  2a,  we  obtain  in  the  Fourier  plane  the  following 
distribution: 

Sf(u,v)=|  |  S(u,v)  |  exp(i<Ku,v))exp(-i27cau/Xf)  +  |S(u,v)|exp(-i4>(u,v)  |2  (la) 
We  present  two  alternatives  for  performing  pattern  recognition.  The  tint  method  consists  in 
registering  the  power  spectrum  |  S(u,v)  | 2  and  then  computing  digitally  the  subtraction  Sp(u,v)- 
2|S(u,v)|2.  Finally,  the  result  is  binarized,  which  is  equivalent  in  a  first  approximation  to 
S»(u,v)=cos(2^-27au/Xf).  The  reference  r(x,y)  can  be  processed  in  a  similar  way  and  in  the 
Fourier  plane,  the  light  distribution  can  be  written  as 

Rp(u,v)=  |  |R(u,v)|exp(i0(u,v))  +  |R(u,v)|exp(-i0(u,v)exp(-i2jtauAf)  |2  (lb) 
where  |R(u,v)|exp(i0(u,v))  is  die  Fourier  transform  of  r(x,y).  Recording  |R(u,v)|2, 
computing  Rp(u,v)  -  2 1 R(u,v)  | 2  and  binarizing  we  obtain  R*(u,v)  =cos(20 +2rcau/Xf).  Finally, 
the  product  S*(u,v)R*(u,v)  is  obtained, 
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Sfc(u,v)R*(u,v)*2cos2($+0)  +  2cos2($-8+2nau/Xf)  (2) 

If  ♦-©■•O,  die  second  cosinus  will  perform  two  Delta-distributions  in  the  correlation  plane  at 
points  x=2a  and  x=-2a. 

A  further  simplification  (method  #2)  can  be  obtained  by  binarizing  the  intensities  (la) 
and  (lb)  considering  as  threshold  the  median  values  of  Sp(u,v)  and  Rp(u,v)  and  then  computing 
the  product  of  these  binary  distributions. 

Figures  2a  and  2b  show  the  scene  and  the  reference  used  as  a  test.  Figures  3  and  4 
show  the  optical  detection  of  the  reference  using  the  architecture  described  in  Figure  1 .  Figure 
3  has  been  obtained  following  the  first  method  and  the  correlation  of  Figure  4  corresponds  to 
binarize  equations  (la)  and  (lb)  using  the  median  value  of  these  distributions. 

As  a  conclusion,  the  use  of  single  SLM  architectures  allow  the  simultaneous 
implementation  of  Vander-Lugt  and  joint  transform  correlators  with  similar  recognition 
capabilities. 


Figure  3:  Optical  correlation  (method  f  1). 


Figure  4:  Optical  correlation  (method  #2). 


Tine  paper  has  been  upportcd  in  pert  by  die  Spanish  CICYT  (Comisidn  interministerial  de  Cieocia  y  Tecnologb) 
project  No.  ROB91-0554. 
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ABSTRACT 

A  joint  transform  correlator  is  described  whose  input  is  provided  by  two  acousto-optic  devices. 
It  is  shown  that  die  system  is  capable  of  detecting  and  direction  finding  signals  which  are  more 
than  10  dB  below  die  noise  level. 


SUMMARY 

A  schematic  diagram  of  the  system  is  shown  in  Figure  l.  It  is  a  standard  joint  transform 
correlator  [1]  in  which  the  inputs  are  provided  by  two  acousto-optic  (AO)  cells  and  die  square 
law  detector  is  a  spatial  light  modulator  such  as  a  liquid  crystal  light  valve  (LCLV).  The  lenses 
and  spatial  filter  required  to  extract  the  first  order  light  from  the  AO  cells  have  been  omitted 
for  clarity. 


LI 


Figure  1 .  The  acousto-optic  joint  transform  correlator  -  schematic. 


MP4/36 


Taking  the  first  order  diffracted  tight  from  the  AO  cells  as  the  input  to  the  system  the  input 
function  is  of  the  form 

o(x,t)  =  f(x-a-Vt)  +  g(x  -hi  — Vt)  (1) 

where  f(t)  and  g(t)  are  the  signals  applied  to  the  AO  cells,  V  is  the  acoustic  velocity  in  the  cells 
and  the  separation  of  the  cells  is  2a.  It  is  straightforward  to  show  that  if  the  Fourier  transforms 
of  f(x)  and  g(x)  are  written  as  |F(u)|exp{j?(u)}  and  |G(u)|exp{jd(u)}  respectively  then  the 
response  of  die  square  law  detector  at  a  given  instant  in  time  will  be 

p(nf  =  |F(u)J2  4|G(u)|2  -+jF(u)|  •|G(u))cos{2Ua  +v<u)  u)}  (2) 

Since  die  two  signal  are  propagating  at  the  same  acoustic  velocity  the  extra  phase  term  due  to 
the  terms  Vt  in  equation  (1)  will  cancel  out  The  final  term  in  equation  (2)  represents  a  set  of 
sinusoidal  hinges  of  spatial  frequency  2a,  amplitude  modulated  by  the  amplitude  of  the  cross 
spectral  density  (CSD)  of  f(x)  and  g(x)  and  phase  modulated  by  the  phase  of  the  CSD.  Taking 
the  Fourier  transform  of  this  fringe  pattern  optically  from  the  read  side  of  the  spatial  tight 
modulator  output  enables  us  to  obtain  the  cross  correlation  function  of  f(t)  and  g(t).  The 
performance  of  the  system  has  been  modelled  on  the  assumption  that  the  inputs  to  the  AO  cells 
are  horn  two  independent  receivers,  a  distance  D  apart,  that  receive  the  same  signal  but  the 
noise  in  the  two  receivers  is  uncorrelated.  Figure  2  shows  a  sample  cross  correlation  output 
from  die  model.  The  time  window  of  the  AO  cell  was  SO  ps,  and  the  signal  tested  was  a  chirp 
of  length  100  ps,  with  a  centre  frequency  of  10  MHz  and  a  bandwidth  of  10  MHz.  The  noise 
bandwidth  was  20  MHz  and  the  signal  to  noise  ratio  was  -12.5  dB.  The  cross  correlation 
function  was  computed  after  integrating  the  cross  spectral  density  on  the  spatial  tight  modulator 
for  100  fis.  The  position  of  the  correlation  peak  gives  a  measure  of  the  time  difference  of 
arrival  of  die  signal  at  die  two  receivers  so  that  direction  finding  is  possible  as  well  as 
detection.  The  performance  of  die  system  is  examined  in  detail  and  results  are  presented  from  a 
practical  system. 
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Abstract 

Both  a  theoretical  analysis  and  practical  results  are  presented  for  a  non-heterodyning  acousto-optic 
correlator  that  uses  the  zeroth  diffraction  order  to  produce  a  true  correlation  function  containing  both 
amplitude  and  phase  information. 


Summary 

The  structure  of  the  zeroth  order  non  heterodyning  space  integrating  acousto-optic  correlator  is  as 
shown  in  figure  1. 


Spatial 


Figure  1.  Physical  structure  of  zeroth  order  space  integrating  acousto-optic  correlator 


The  basic  architecture  is  similar  to  many  correlators  previously  described  in  die  literature  [1,2],  but 
there  are  two  important  differences.  The  spatial  filters  select  all  or  part  of  die  zeroth  diffraction  order. 
Most  correlators  use  the  first  order  and  some  papers  assume  that  the  zeroth  order  contains  no 
information.  In  reality,  die  zeroth  order  contains  as  much  information  as  all  the  other  orders  put 
together,  but  it  can  be  difficult  to  access  due  to  large  optical  power  incident  on  a  photodetector  causing 
saturation  or  excessive  shot  noise.  However,  it  can  be  shown  that  the  zeroth  diffraction  order  has  sub 
orders  that  can  be  used  very  effectively  [3].  The  second  important  difference  lies  in  die  way  the  signals 
are  introduced  into  die  acousto-optic  cells.  A  common  technique  is  to  insert  the  signals  as  double 
sideband  suppressed  carrier  modulation  of  a  carrier  at  die  centre  frequency  of  die  acousto-optic  cell. 
The  modulation  used  best  is  double  sideband  large  carrier,  the  difference  is  crucial. 

The  operation  of  this  correlator  is  as  follows.  One  of  die  two  signals  to  be  correlated  is  inserted  into 
die  first  acousto-optic  cell  as  double  sideband  large  earner  modulation  of  a  carrier  (For  the  Te02  cells 
used  in  die  correlator  built  by  the  authors  the  carrier  frequency  is  45  MHz).  At  the  optical  output  of  the 
first  acousto-optic  cell  the  optical  wavefront  is  phase  modulated  by  die  acoustic  wave  in  die 
acousto-optic  ceil.  The  Fourier  transform  of  this  wave  function  is  formed  at  die  first  spatial  filter  which 
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passes  all  of  the  zeroth  diffraction  order  from  the  first  cell  onto  the  second  acousto-optic  cell.  The 
combination  of  the  spatial  reversal  of  the  first  signal  through  the  imaging  process  and  time  reversal  of 
the  second  signal  before  injection  into  the  second  acousto-optic  cell  leads  to  the  counter  propagation  of 
die  signals  required  for  correlation.  The  final  lens  forms  the  Fourier  transform  of  the  output  from  die 
second  acousto-optic  cell  at  die  final  spatial  filter  which  selects  out  the  first  sub  order  of  the  zeroth 
diffraction  order.  It  can  be  shown  [4]  that  the  photodetector,  acting  as  a  spatially  integrating  square  law 
detector,  then  forms  a  true  correlation  function,  containing  both  amplitude  and  phase  information,  which 
can  be  fed  through  a  simple  highpass  filter  directly  to  (e.g.)  an  oscilloscope. 


The  autocorrelation  function  for  a  linear  chirp  pulse,  sweeping  from  4.25  MHz  to  5.75  MHz  in  15  ps 
(BT  -  22.5)  using  the  correlator  shown  in  figure  1,  is  shown  in  figure  2. 


Figure  2.  Autocorrelation  of  a  linear  chirp  pulse 
produced  by  the  correlator  shown  in  figure  1. 
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Abstract 

We  show  that  edge  enhancement  Is  inherent  to  the  photorefraetive  recording.  Its  effect  in 
Joint  Transform  Optical  Correlators  is  demonstrated  and  a  computational  model,  valid  for 
most  input  images,  is  given . 


Summary 

The  improving  availability  of  liquid  crystal  Spatial  Light  Modulators,  and  sensitive 
photorefraetive  materials  makes  possible  the  implementation  of  effective  optical  correlators 
using  low  energy  laser  sources.  The  Joint  Transform  Correlator  (JTC)  is  well  adapted  to 
applications  where  the  size  of  the  correlator  is  an  issue  |1J.  The  purpose  of  this  paper  is  to 
demonstrate  some  specific  properties  introduced  by  the  response  of  the  photorefraetive 
crystal  used  in  the  Fourier  plane,  and  to  present  a  model  that  predicts  their  response  in 
those  particular  experimental  conditions. 

The  geometry  of  our  experimental  set-up  is  directly  derived  from  that  described  by 
Rajbenbach  &  al  1*1.  The  object  to  be  recognized  and  the  scene  where  it  is  expected  are 
Introduced  in  the  input  plane  using  a  TFT  nematic  liquid  crystal  spatial  light  modulator 
illuminated  by  the  collimated  beam  of  a  diode  pumped  doubled  YAG  laser.  Their  Fourier 
transforms  o  and  s  are  formed  by  the  same  lens.  The  resulting  distribution  of  Intensity  is 
recorded  as  a  modulation  of  refractive  index  by  a  BSO  crystal.  It  is  observed  using  the  670 
nm  beam  of  a  laser  diode  incident  from  the  back  at  an  angle  adjusted  to  fulfill  the  Bragg 
condition.  At  that  wavelength,  the  sensitivity  of  the  BSO  crystal  is  considered  as  negligible 
compared  to  that  at  the  writing  wavelength  (532  nm),  so  that  reading  and  -  tting  may  take 
place  simultaneously  .  The  Fourier  transform  of  the  amplitude  distribution  diffracted  by  the 
photorefraetive  grating  is  observed  in  real  time  In  the  correlation  plane  Pc.  using  a  CCD 
camera. 

Conventional  models  describe  the  build-up  of  the  photorefraetive  effect  under  illumination 
by  a  one  dimensional  sinusoidal  interference  pattern  of  average  illumination  Jo .  modulation 
depth  m  and  wave  vector  k.  Assuming  a  uniform  Io  and  a  weak  m  a  space  charge  field  is 
predicted  whose  steady  state  value  Esc  does  not  depend  on  Io  and  is  proportional  to  m  . 
Through  linear  electrooptic  effect,  an  index  grating  is  formed  with  a  modulation  proportional 
to  Esc-  For  objects  much  smaller  that  their  separation,  the  model  still  applies  to  the  joint 
transform  correlator.  At  steady  state,  the  induced  index  modulation  and  the  diffracted 


amplitude  are  found  to  be  proportional  not  to  o(r')s*(r')  as  expected  from  a  linear 
holographic  recording  but  to  m(r*  )=  o(r*  )s  *  +|o(r'  +|j(r')|2  j  where  the  term  Id 

describes  the  effect  of  the  crystal's  dark  conductivity  and  stray  light.  71115  non  linearity  Is 
typical  of  the  Joint  Transform  Correlator.  In  a  Vander  Lugt  correlator  one  of  the  terms 
would  describes  the  reference  plane  wave.  The  value  of  Its  fluence  would  be  chosen  either 
to  restore  the  linearity  of  the  correlator  or  to  cause  edge  enhancement!3!.  This  non  linearity 
is  Illustrated  by  the  correlation  of  two  Identical  slits.  The  shape  of  the  object  only  affects  the 
local  Intensity  lo  that  only  determines  the  speed  of  the  effect,  while  m  equals  unity  all  over 
the  crystal.  A  quasi-uniform  diffraction  efficiency  is  then  expected,  that  generates  a  sharp 
correlation  peak  located  at  the  maximum  of  the  correlation  function  and  periodical  side 
peaks  caused  by  the  zeros  of  the  sine2  function.  This  non  linear  response  causes 
amplification  of  the  spatial  frequencies  of  weak  amplitude  in  the  objects.  In  most  cases,  this 
gives  rise  to  edge  enhancement.  However  It  Is  moderated  by  noise  and  the  reading  beam. 
Experimental  evidences  will  be  presented. 

The  model  for  the  photorefractlve  effect  fain*  when  the  spatial  distribution  of  illumination 
varies  rapidly  on  the  crystal  or  is  two-dimensional.  It  also  falls  to  describe  the  case  where 

the  modulation  of  one  grating  locally 

Autocorrelation  of  a  disk  t  ' 

becomes  close  to  unity.  A  "whole  beam 

method"  has  been  recently  proposed  I4!. 
ri  Jar  M  __  ..  We  show  that  it  provides  a  simple 

ill analytical  steady-state  solution  of  the 

Photorefractlve  equations  in  the  experimental  conditions 
HPpl'PBlSMifWMK  correlator  of  the  Joint  Fourier  Transform  correlator. 

Jit  fully  describes  the  edge  enhancement 
in  the  correlator,  including  the  effect  of 
correlator  the  rea(jlng  t>cam  which  affects  the 

.  w  .  width  of  the  correlation  peaks.  It 

_  ^  *  provides  an  accurate  simulation  of  the 

correlator  that  will  be  illustrated. 

These  results  are  part  of  a  contribution  to  the  NAOPIA II  Project,  funded  by  the  Commission 
of  the  European  Community  under  the  ESPRIT  Programm  for  Research  and  Development. 

References 

[1]  B.  Loiseaux,  G.  Ollaquer,  J.P.  Huignard,  Optical  Engineering,  Vol  24,  n°l,  pp.144- 
149,  (1935). 

(2)  H.  Rajbenbach,  S.  Bann,  P.  Ftefrtgier,  P.  Joffre,  J.-P.  Huignard.  H.-S.  Buchkremer, 
A.S.  Jensen,  E.  Rasmussen.  K.H.  Brenner.  G.  Lohman,  Appl.  Opt.Vol  3l,n°26. 
pp. 5666-5674,  (1992). 

(31  J.  Felnberg.  Optics  Letters,  Vol  5,  N*8.  pp.330-332,  (1980) 

(4]  M.  Cronln-Golomb ,  Optics  Communications  89  ,  pp.  276-282,  (1992) 


Prediction 


Photorefractlve 

correlator 


Linear 

correlator 


MP7/41 


Optical  matrix  multiplication  and  its  application  in  pattern  recognition 

Yansong  Chen,  Dehua  Li,  Shihai  Zheng,  Yuhe  Zhang,  Chengxinag  Li 

Institute  of  Physics,  Chinese  Academy  of  Sciences 
Beijing  100080,  China 
Tel:  86-1-2559131x257,  Fax:  86-1-2562605 

Abstract 

A  4/-type  optical  system  for  matrix  multiplication  is  presented,  and  with  the  system  the 
hybrid  optical  image  processor  is  constructed  for  recognizing  Roman  letters,  Arabic  letters 
and  some  airplanes.  The  experiments  are  successful  in  the  case  of  above  input  images 
undergoing  shift  and  rotation. 


Summary 


Optical  matrix  multiplication  is  a  basic  optical  computing  operation,  which  is  widely 
utilized  in  optical  image  processing  and  artificial  neural  network.  In  this  paper,  based  on  the 
study  of  optical  general  transformation,  a  4/-type  optical  system  for  matrix  multiplication  is 
presented  and  with  this  system  a  hybrid  optical  image  processor  is  designed  and  set  up  for 
pattern  recognition.  The  system,  shown  in  Figure  1,  is  consisting  of  two  Fourier  lenses  and  a 
holographic  mask,  by  which  the  matrix  inner-product  multiplication  proceeds  in  a  nongearing 
manner  and  many  matrix  multiplication  can  be  achieved  by  connecting  several  such  systems 
in  series.  As  an  example,  the  inner  products  of  4x8  and  8x4  matrices  are  calculated,  yielding 
results  that  are  in  agreement  with  theoretical  values. 

A  hybrid  optical  processor  constructed  by  the  matrix  multiplicator  is  designed  for 
computing  invariant  moments  of  images  in  real  time.  Taking  six  capital  Roman  letters  (C,  F, 
G,  H,  J,  L)  as  the  input  image  the  processor  is  tested  for  recognizing  them.  The  experimental 
results  show  that  a  letter  is  of  approximate  values  of  invariant  moment  for  its  undergoing 
translation  and  rotation,  and  the  moments  of  different  letter  are  distinct  enough  for  one  to 
recognize  them.  Therefore  an  input  letter  can  be  recognized  by  extracting  its  invariant 
moments  with  the  processor  even  if  the  letter  taking  shift  and  rotation. 

With  the  matrix  multiplicator  the  hybrid  character  recognition  system  is  also  designed 
and  set  up,  which  is  composed  of  two  parts,  the  image  feature  extractor  and  the  inner-product 
correlator.  The  feature  extractor  draws  32  bits'  information  forming  a  vector  from  an  input 
image  and  the  correlator  carries  out  inner-product  of  the  vector  with  storage  memory  matrix. 
Ten  Arabic  figures  from  0  to  9  and  eight  various  airplanes  as  the  input  images  are  tested  by 
the  system.  Hie  experimental  results  show  that,  for  printing  Arabic  figures  and  the  eight 
airplanes,  that  are  permitted  to  rotate  an  angle  of  15°,  the  recognized  ration  reaches  100%,  and 
for  hand-writing  Arabic  figures  about  70%  of  their  recognition  may  be  reached. 
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We  described  the  experimental  results  about  construction 
and  application  the  precision  optical  correlator  for  analysis 
of  large  format  Images  with  the  computer  processing  of  output 
fields  for  fast  ana  precess  determination  of  correlation  ma¬ 
xima  coordinates. 

For  the  structural  analysis  of  highly  informative  images, 
for  example,  astro-  and  aerial  photography,  we  effectively 
use  optical  correlators  /l/.  For  the  solution  of  the  task  in 
precess  determination  of  coordinates  of  correlation  maxima 
for  the  analysis  of  large  format  images  (180*180  mm)  we  have 
developed  precision  coherent  optical  correlator  according  to 
the  scheme  of  Wander  Lugt  on  the  basis  of  optical  Fourier 
processor  with  aperture  500  mm  and  resolution  to  100  lin/mm. 
holographic  registrator  type  HRS  1001  with  the  working  area 
40*40  mm  and  resolution  to  1500  lln/mm,  scanning  photorecei¬ 
ving  units  Elkonix  1000  (Kodak  Co.)  with  the  resolution  of 
elements  4096*4096.  conjugated  with  the  computer  IBM  PC  AT. 

The  output  image  of  the  correlation  function,  formed  by 
the  Fourier  processor,  is  read  by  the  scanning  unit,  digiti¬ 
zed  with  the  gradation  number  256  and  put  into  the  computer 
for  the  digital  processing.  Realized  so.  algorithms  permit  to 
automatically  determine  the  location  of  correlation  peaks, 
increasing  given  by  the  operator  threshold,  independently  of 
their  dimensions  and  form  and  with  regard  for  unevennes  of 
the  background,  on  which  images  are  optically  formed.  Algo¬ 
rithms  are  optimized  according  to  fast  activity  and  permit  to 
analyse  one  correlation  field  with  information  capacity 
4096*4096*8  bit  for  the  time  during  five  minutes  and  writing 
results  into  the  file  and  the  possibility  of  depicting  the 
map  of  location  of  detected  correlation  maxima  and  fragments 
of  read  image  in  pseudocolours  on  the  monitor  of  the  compu- 


In  the  report  we  describe  the  peculiarities  of  mounting, 
methods  of  conducting  Investigations,  and  also  we  give  expe¬ 
rimentally  received  estimates  of  maximum  possibilities  for 
the  analysis  of  large  format  highly  informative  Images.  The 
usage  of  the  described  automated  correlator  permits  to  signi¬ 
ficantly  shorten  the  time  of  analysis  of  Images  and  ensure 
high  preclsslon  and  reliability  of  received  results. 

We  shall  presents  also  the  results  about  using  of  2-D 
thin  field  optical  bistable  devices  for  the  analysis  of  cohe¬ 
rent  Images,  which  forms  by  optical  correlator  with  very  high 
speed.  The  unit,  which  make  this  analysis,  consists  of  one 
2-D  optical  bistable  device,  two  multielement  linear  photo¬ 
sensors  and  one  cubic  beamspletter  /2/.  This  unit  provided  a 
good  signal/noise  relation.  The  2-D  optical  bistable  devices 
had  30  mm  diameter  and  made  In  the  Devision  of  Optical  Prob¬ 
lem  of  Informatics  (Academy  of  Sciences  of  Belarus). 
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Abstract: 

A  new  architecture  for  optical  coordaate  transformation  with  two  phase  filters  employed  is 
proposed,  which  is  described  by  the  Wigner  distribution  fraction.  A  precise  theory  and 
experimental  results  are  presented. 

Many  attempts  for  optical  coordinate  transformation  are  based  on  tne  coherent  optical  Fourier 

transformation.1-3  TO  propose  here  an  alternative  approach,  in  which  two  phase-only  filters  are  used  and  the  input  plane 
and  tile  output  plane  are  conjugate  each  other,  m  shown  in  Fig.  1.  IWo  phase-only  filters  exp{  <&,(r>]  and  exp{  02(r)]  are 
positioned  m  a  distance  of  z  from  the  input  plane  and  at  the  Fourier  transform  plane  of  a  lens  L,,  respectively.  The 
Fourier  transform  of  tbe  first  filter  is  obtained  in  the  second  filter  plane.  The  final  output  is  the  Fourier  transform  of  die 
transmitted  light  from  the  second  filter.  In  toms  of  paraxial  approximation,  the  optical  system  is  generally  described  by 
tbe  following  double  Wigner  distribution  fraction,-4 

*<'.  A)  I*  ['/t  •  'Sf  Hk  ‘  )]  (1) 

where  hff\;r0)  denotes  the  point  spread  function  of  the  system.  Equation  (1)  gives  a  relationship  between  heights  r  and 
angles  a  in  the  input  and  the  output  planes,  hi  the  particular  case  of  Fig.  1,  the  optical  system  is  represented  by 

Xjffjjfif  ^  r.  ’  ‘  z  a‘  ^  ^  r<"  r.  561  a>'  "*«“  k~dr^~*dr,da‘ 

«  r,  -fa;  )  8 (  a,  +  ^  )dr, da,«  ry  - r,)5(  a;  -a,- Hrfla, 

8(r  +  /a,  )B(a.  -  ^  Jdr/d a,  ® 


Gr,.a )  (r^,) 


exptd>,(r)  ] 


expt«y[r)l 


Fig.  1  Optical  system  for  coordinate  transform. 


Suppose  tbe  phase  of  the  second  filter  is  written  by  th.  following  form; 


«*PC®WJ»«p{( 


-&)Js(^)dr,] 


TO  have  the  double  Wigner  distribution  fraction  written  by; 
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K  (  r,  *,,,  )- Ur,-  -  r,  >««,-  «,  - 1  1««  r;  -  rt )  8U,  ^ )) 


r.mS(rt) 

Then  we  have  final  phase  distributions  of  filters  as  follows; 

^d')  l*,M 

As  an  typical  example,  we  consider  the  following  coordinate  transformation; 
g.Or,,?,  latitats,) 

The  phase  distributions  of  the  filters  an  given  by 

0  (x  y  >  „  .  k.  [*1 .  *1  +  qx  +( l  - 1)  y  *  1+c 
1  ,y  Z  2  Si  % 


4>ttx  ,y  )■  - ^  InlCt  )ln(^  1+2^ 

*ScV‘*i 2 3 4V*f  •l>'<l>,,’4c' 


(4) 


(5) 


(6) 


C7) 


(8) 

(?) 

(10) 

(ID 


Vfe  have  made  the  phase-only  fibers  with  the  computer-generated  hologram  technique.  Figure  2  shows  an 
input  pattens  example  (a)  of  file  grating  whose  pitches  are  exponentially  changed.  Because  of  the  logarithmic  coordinate 
transformation  system,  we  have  the  output  (b)  of  an  eqni-spaced  grating  image. 


mini 

mini 

limy 


nim 

‘  ’’if? 

;Ti  u  i 


Pig.  2  Logarithmic  coordinate  transformation  (a)  Input  image,  (b)  output  image. 


have  proposed  anew  architecture  of  the  optical  coordinate  transformation  using  a  pair  of  phase-only  filters 
and  derived  the  analytical  solution  of  foe  films.  Optical  experiments  with  computer-generated  filters  verifies  the 
analytical  rotation  for  logarithmic  coordinate  transformation.  Features  of  the  architecture  are  (fiscussed  in  comparison 
with  Bryugdahl  Fourier  transform  method. 
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Abstract 

A  new  means  of  breaking  symmetry  in  binary  phase  only  matched  filters  is  reported. 
A  randomly  pixellated  diffractive  element  is  combined  with  a  dynamic  binary  filter  to 
produce  a  pseudo  four-level  phase  matched  filter. 


The  Binary  Phase  Only  Matched  filter  (BPOMF)  has  been  well  researched  to  ensure  such 
features  as  invariance  robustness.  A  much  less  publicised  feature  of  the  BPOMF  is  the 
inherent  rotational  symmetry  due  to  the  binary  phase  levels  of  0  and  x.  The  symmetry  means 
that  when  an  object  appears  either  correctly  or  rotated  by  180°  in  the  input,  the  corresponding 
correlation  peaks  will  be  identical.  In  an  application  such  as  road  sign  recognition,  this  is  an 
undesirable  property,  as  triangular  signs  have  very  different  implications  for  each  orientation, 
hence  an  asymmetric  BPOMF  is  desirable.  Symmetry  can  be  broken  by  increasing  the 
number  of  phase  levels,  but  this  requires  another  binary  Spatial  Light  Modulator  (SLM) 
for  the  filterfl].  We  suggest  a  system  which  produces  a  pseudo  four  level  phase  system,  not 
total  quaternary  phase,  but  sufficiently  flexible  to  break  the  BPOMF  symmetry. 

The  correlator  is  the  classical  4/  Vaader  Lugt  BPOMF  constructed  using  binary  ferroelectric 
liquid  crystal  SLMs  as  the  modulation  devices.  Included  with  the  second  SLM  (in  binary 
phase  mode)  is  a  pixellated  diffractive  dement  (random  phase  array)  to  create  the  four  level 
system.  The  random  phase  array  has  the  same  pixel  pitch  as  the  filter  SLM  and  is  aligned 
pixel  to  pixel  with  the  SLM.  Each  pixel  on  the  random  phase  array  is  set  to  either  0  or  x/2 
and  is  fixed.  Ideally,  the  phase  array  would  be  physically  etched  onto  the  glass  surface  of 
the  SLM,  but  for  initial  demonstration,  we  have  constructed  a  photoresist  pattern  on  a  A/10 
optical  fiat  to  generate  the  x/2  phase  step. 

The  SLMs  used  are  128x128  pixel  devices  from  THORN  EMI  CRL.  The  pixel  pitch  is  220/um, 
so  a  chrome  on  glass  mask  was  made  containing  the  random  phase  array  pattern.  The 
initial  random  phase  array  was  made  by  spinning,  exposing  and  developing  Shipley  S1400-31 
photoresist  onto  a  A/10  glass  optical  fiat,  but  this  gave  too  large  a  step  in  the  photoresist. 
Shipley  S1400-17  photoresist  was  then  tried  giving  a  step  size  of  320nm  (247nm  required  for 
x/2  at  A  as  633nm).  However,  if  the  wavelength  of  780nm  is  used,  then  the  phase  step  will 
be  1.65rad  which  is  dose  enough  to  x/2. 
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Figure  1:  Simulation  of  the  pseudo  four  phase  level  system  applied  to  a  triangular  road  sign 


Extensive  simulation  of  the  effects  of  the  random  phase  array  have  been  completed.  Figure 
(1)  shows  the  effect  of  the  random  phase  array  (x/2  delay)  for  a  triangular  road  sign.  The 
BPOMF  was  generated  by  a  simulated  annealing  algorithm[2]  which  was  adapted  to  include 
the  random  phase  array  in  the  calculation.  The  suppression  of  the  symmetric  order  was 
16dB  and  there  was  no  increase  in  signal  to  noise  ratio  (26.54B).  For  the  constructed  random 
phase  array  with  a  delay  of  1.65rad,  the  symmetric  suppression  was  reduced  to  15.1dB  but 
the  signal  to  noise  ratio  remained  at  26-5dB. 


We  have  presented  a  new  means  of  suppressing  the  symmetric  correlation  peaks  in  the 
BPOMF  without  requiring  an  extra  SLM  or  penalising  the  signal  to  noise  ratio  of  the  corre¬ 
lation  peak.  An  experimental  evaluation  will  be  presented  at  the  conference. 
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Abdnet 

We  introduce  the  notion  of  apace- variant  coordinate  transformations,  conaider  aome  of 
their  applications  in  computer  science  and  show  hour  they  can  be  implemented  optically. 


1  Definition  of  Space- Variant  Coordinate  Transformations 

The  optical  implementation  of  coordinate  transformations  baa  received  much  attention  in  optical  literature 
since  the  publication  of  O.  Bryngdahl’a  rlaaairal  paper  in  1974  [B]. 

However,  both  O.  Bryngdahl  and  all  aubaequcnt  researchers  have  been  concerned  with  optical  imple¬ 
mentation  of  coordinate  transformations 

=  /(*,*); 

l®  =  #(*.p), 

such  that  both  /  and  g  are  eipreesed  in  elementary  functions  and  these  expressions  do  not  vary  across 
the  plane.  We  cell  each  coordinate  transformations  "Space- Invariant”. 

For  example,  the  polar  mapping 

fu=  y/*i  +  y*; 

l*  =  tau"1(f) 

is  a  space- in  variant  coordinate  transformation  because  all  points  on  the  plane  are  transformed  due  to  the 

(«,p)  — (>/£*+?,  tan-‘(J)), 

whereas  the  following  coordinate  transformation 

{u wains;  v  =  cosy,  if**  +  p*<9; 

«***;  v  =  lap,  if*  >4; 
u  =  y/r,  vse*,  if  p  >  4lt*  <  4; 
a  =  0;  e  s  0,  otherwise 

is  space- variant  because  it  cannot  be  represented  in  the  form 

/  «  =  /(*.  p); 

l  «  =  »(*.»). 

where  /  and  g  are  elementary  functions  invariant  across  the  plane. 

A  space-variant  coordinate  transformation 

/«  =  /<(*, P);  *wff(c,p),  if  (*,p)€  Di, 

\  n  *  0;  *  *  0,  other  wire 

is  cafiad  adaptive  if  domataa  A  am  aot  knows  ia  advance  and  are  determined  in  the  course  of  the  solution 
of  a  problem. 

Adaptive  space- variant  coordinate  transformations  play  an  important  role  in  the  various  problems 
of  computer  science  as  a  whole,  sad  padteta  recognition,  in  particular.  One  of  these  applications  is 
eoanidsred  hr  the  section  that  follows. 
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2  An  Application 

Consider  the  problem  of  recognising  all  cross-roads  in  a  photograph  of  the  land,  such  that  the  angle 
between  roads  ia  exactly  v/2. 

With  the  help  of  space-variant  coordinate  transformations,  this  can  be  done  as  follows. 

Step  1.  Let  Ii  be  the  image  of  a  photograph  and  let  {(*t.Hk)}<!Li  he  the  coordinates  of  cross-roads 
which  are  easily  recognised  with  the  help  of  other  techniques.  Apply  to  this  image  the  following  space- 
variant  coordinate  transformation 

/  «  m  (s  -  Mi),  0  =  (at  -  g),  if  y/(*  -*<)*  +  (»-*)*  <  r; 

\  s  =  0,  v  =  0  otherwise, 

where  r  is  a  sufficiently  —wall  radius  (significantly  leas  than  the  minimum  distance  between  points 
{(*.*)}£i  which  can  easily  be  computed  optically  in  constant  time  using  the  algorithm  described 
in  [KS].) 

As  a  result,  we  obtain  a  new  image  /j  which  concidee  with  Ii  only  in  the  vicinity  of  those  cross-roads 
which  have  angle  x/2  because  the  above  coordinate  transformation  rotates  each  cross-road  by  s/2  with 
respect  to  the  corresponding  (*,,!*)  and,  hence,  only  those  cross-roads  which  have  angle  s/2  remain 
invariant  after  the  rotation. 

Step  2.  Having  now  subtracted  from  ft,  we  obtain  a  new  image  /*  =  7i  \  /j  containing  only  those 
cross-roads  whose  angles  ate  not  equal  to  s/2. 

Step  3.  Compute  U  =  h\  h-  Obviously,  image  I4  contains  the  desired  cross-roads  only. 


3  Optical  Implementation 


Let  Hin9mr(x,  v)  be  the  transmittance  distribution  of  a  CGH  for  space-invariant  coordinate  transformation 

/«  =  /(*,»); 

\  *  =  #(*.*)• 

Then  space-variant  coordinate  transformation 

/«•  =  /(*-*<.»-*);  «  =  »(*-*<,»-#),  if  y/{x  -  *i)J  +  (»-  a)*  <  r; 

l «  as  0;  v  =  0,  otherwise, 

which  is  adaptive  to  points  {(**,  a))i!i  can  easily  be  implemented  optically  using  CGH  whose  transmit¬ 
tance  distribution  is  described  as  follows: 


•si 


where 


Thus,  the  CGH  for  an  adaptive  space- variant  coordinate  transformation  can  be  obtained  on  a  photo- 
addressed  SLM  ia  constant  time  from  the  CGH  for  the  corresponding  space-invariant  transformation 
(which  is  assumed  to  be  known)  using  an  optical  convolver.  Hence,  the  transformation  itself  can  also  be 
performed  optically  in  constant  time. 
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Abstract 

We  present  an  optical  algorithm  for  extracting  regions  of  minimum/maximum  intensity 
from  an  image.  Time-complexity  of  the  algorithm  is  0(1). 


1  Introduction 

Extracting  regions  of  maximum/minimum  intensity  from  an  image,  as  well  as  thresholding  an  image  at 
a  given  level  of  intensity,  are  important  operations  in  image  processing  [NR], 

Even  though  thresholding  has  received  considerable  attention  from  the  optical  engineering  com¬ 
munity  [WT,  SMLBTSSC],  little  work  has  been  done  on  the  optical  implementation  of  the  maxi¬ 
mum /minimum  operation.  The  only  exception  is  [GCHHZY]  which,  however,  deals  with  the  computing 
maxi  mum /minimum  entry  of  a  1-dimensional  array  of  light  beams  rather  than  a  2-dimensional  image. 

The  point  is  that  extracting  the  above  regions  cannot  be  performed  in  a  purely  engineering  manner 
(as  is  the  case  for  thresholding),  but  requires  a  somewhat  algorithmic  approach. 

In  this  paper  we  present  an  optical  algorithm  which  extracts  regions  of  maximum/minimum  intensity 
from  an  image  and  does  it  in  constant  time. 

2  The  Zero-Crossing  Approach  and  Its  Drawbacks 

It  may  appear  that  the  simplest  way  to  solve  the  above  problem  is  to  make  use  of  spatial  differentiation 
of  an  image  that  can  be  readily  implemented  optically.  Indeed, 

V)  dl{x,y) 

‘  ~sr  „ — -  °' 

if  /(xmax.ftnu)  =  ma xl(x,y),  where  I(x,y)  is  the  intensity  of  an  image  at  the  point  (x,y).  Hence, 

(*.») 

computing  (xmu,lfa>u)  can  be  performed  optically  by  determining  regions  of  sero  intensity  of  the  differ¬ 
entiated  image. 

However,  this  approach  does  not  allow  one  to  distinguish  between  minimum  intensity  and  maximum 
intensity,  or  between  local  extremum  and  global  extremum,  not  to  mention  its  sensitivity  to  noise  in  the 
input  data. 

To  avoid  these  drawbacks,  we  propose  an  optical  algorithm  for  computing  minimum /maximum  of  the 
bivariate  function  I(x,  y)  using  modulation  (or  frequency  modulation). 
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3  0- modulation  approach  to  maxima  operation 

It  is  well  known  that  $— modulation  of  an  image  followed  by  a  Fourier  transform,  maps  a  point  (*,  y) 
of  the  image  to  the  point  (cos  /(*,  y),  sin  J(x,  y)),  where  I(x,y)  is  intensity  of  the  image  at  (x,y)  [AL]. 
Hence,  regions  of  minimum  intensity  are  mapped  to  the  point  closest  to  the  X-axis,  whereas  regions  of 
maximum  intensity  are  mapped  to  the  point  furthest  from  the  X-axis. 

These  points  can  be  extracted  optically  in  constant  time  as  follows: 

Step  1.  Project  points  {(u,  ti)|u  =  cos  I(x,  y),  v  =  sin  I(x,  y)}  obtained  as  a  result  of  the  ^-modulation 
of  the  intensity  distribution  7(z,  y)  to  the  Y-axis.  Obviously,  this  can  be  readily  done  with  the  help  of 
CGH  which  performs  the  coordinate  transformation 

/«  =  0; 

=  y. 

As  a  result  we  obtain  a  set  of  points  P  =  {(0,sin/(x,y))}. 

Step  2.  Compute  the  vector  sum  P  -f  Ray  of  the  points  P  with  the  Ray  emanating  from  the  origin  of 
coordinates  upward  along  the  Y-axis  (obviously,  it  can  be  done  optically  in  constant  time  with  the  help 
of  an  optical  convolver.) 

As  a  result,  we  obtain  an  image  where  the  point  (0,sin(min/(x,y)))  has  unit  intensity  whereas  other 

(*.») 

points  have  higher  intensity.  Hence  this  point  can  be  extracted  by  thresholding  as  follows: 

Step  3.  Threshold  the  image  obtained  at  Step  2  at  the  unit  level  of  intensity  and  extract  thereby  the 

point  (0,sin(min  I(x,y))). 

(*.») 

Step  4.  Demodulate  the  point  (0,sin(min/(x,y)))  and  obtain  thereby  the  region  of  minimum  intensity 
of  the  input  image. 

An  algorithm  for  extracting  the  region  of  maximum  intensity  of  an  image  is  the  same  up  to  substituting 
the  vector  sum  P  +  Ray  at  Step  2  by  the  vector  difference  P  -  Ray. 
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Abstract:  We  show  the  possibility  of  implementing  the  LIFE  game  by  means  of  an  optical  correlator. 
Since  this  cellular  automaton  is  an  universal  computer  there  is  no  inherent  limitation  in  the 
information  processing  capabilities  of  correlators. 

Optical  correlators  are  probably  the  most  popular  and  well -studied  devices  used  in  optical 
pattern  recognition.  They  are  applied  to  a  wide  variety  of  problems  including  those  requiring  multiple 
invariances.  However,  there  is  no  guarantee  that  a  particular  problem  can  be  solved  by  means  of  an 
optical  correlator,  and  therefore  it  is  a  legitimate  question  whether  they  can  be  considered  as  general 
purpose  classifier  devices. 

In  this  communication  we  show  that  it  is  possible  to  implement  the  game  of  LIFE  by  means 
of  correlation  and  a  simple  nonlinear  postprocessing  of  the  resulting  correlation  plane.  This  procedure 
is  very  similar  to  that  used  in  optical  pattern  recognition,  although  the  nonlinear  function  is  slightly 
different  (we  need  a  rectangle  function  instead  of  the  threshold  function). 

Cellular  automata  are  usually  described  as  a  regular  infinite  array  of  cells  that  can  take  one 
of  several  allowed  states.  The  automata  evolve  in  discrete  time  steps  (generations)  governed  by 
deterministic  rules.  The  state  of  a  given  cell  in  the  generation  k+ 1  is  a  function  of  its  previous  state 
(that  at  generation  k)  as  well  as  of  the  states  of  its  neighbors  (also  at  generation  k).  The  updating  of 
the  cells  is  performed  synchronously,  that  is  all  at  a  time.  Some  of  these  cellular  machines  exhibit  a 
behavior  complex  enough  to  support  universal  computation1.  They  are  called  class-IV  automata  and 
LIFE  is  one  of  them3.  The  possibility  of  being  implemented  by  correlation  leads  to  the  conclusion  that 
a  correlator  is  as  powerful,  in  its  information  processing  capabilities,  as  the  most  powerful  machines 
known  (or  for  those  who  accept  the  Church-Turing  thesis  as  the  most  powerful  machines  that  can 
exist),  namely  universal  computers. 

The  cells  of  LIFE  can  be  in  one  of  foe  two  possible  states,  dead  or  alive.  The  transitions 
between  these  two  states  are  controlled  by  foe  states  of  its  eight  neighboring  cells  following  these 
basic  rules: 

-  A  cell  that  is  alive  at  time  k  continues  living  at  time  k-h  1  if  and  only  if  it  has  2  or  3  living 
neighbors. 

-  A  cell  that  is  dead  at  time  k  becomes  alive  at  time  k+1  if  it  has  exactly  three  living 
neighbors. 

From  these  rules  it  is  evident  that  foe  state  of  any  cell  is  determined  by  foe  total  sum 
(assuming  one  for  live  cells  and  zero  for  foe  dead  ones)  of  foe  values  of  its  neighbors,  and  not 
depends  on  their  individual  values.  This  property  is  called  total istic  and  enables  foe  optical 
implementation.  By  correlating  an  input  LIFE  pattern  with  foe  3x3  filter: 

1  1  1 
1  0.5  1 
.1  1  1. 

and  processing  foe  output  intensity  distribution  by  foe  function  f(x)  =rect  (x/10  -  1)  we  obtain  foe 
next  generation. 

As  an  example  of  foe  computing  capabilities  of  foe  automaton  we  show  foe  construction  of 
an  AND  gate.  The  correlation  was  simulated  by  using  foe  Fast  Fourier  Transform  (FFT)  algorithm 
over  an  array  of  128x128  cells.  The  input  pattern  is  that  shown  in  Fig.  1  where  foe  two  inputs  are 
labelled  by  A  and  B.  The  signals  are  carried  by  patterns  formed  by  five  cells  called  gliders  (labelled 


3, 
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with  ones)  which  move  one  position  diagonally  each  four  generations.  The  presence  or  absence  of  a 
glider  represents  the  bit  1  or  0  respectively.  The  structure  at  the  bottom  left  comer  is  a  glider  gun, 
a  periodic  pattern  that  emits  a  glider  after  30  generations.  With  the  proper  timing  and  alignment  foe 
collision  (indicated  by  Q  of  two  gliders  produces  a  vanishing  reaction  which  annihilates  both  patterns, 
as  shown  in  Fig.  2.  The  logic  operation  is  based  on  these  collisions.  When  there  is  a  glider  in  both 
channels  (A  and  B)  foe  collision  of  one  of  them  opens  a  hole  in  foe  stream  produced  by  foe  glider 
gun,  which  is  profited  by  foe  second  to  pass  through  (Fig.  3).  When  there  is  only  one  glider  it  is 
eliminated  by  foe  stream  (Fig.  3)  and  therefore  by  detecting  foe  presence  or  absence  of  gliders  at  foe 
position  indicated  in  Fig.  4  at  predetermined  times  we  obtain  foe  output  of  an  AND  gate.  The  rest 
of  foe  stream  produced  by  foe  glider  gun  is  eliminated  by  foe  eater  ( foe  collision  C  shown  in  Fig. 
4). 


Figure  1.  Initial  pattern.  Figure  2.  The  first  glider  of  channel  A 


opens  a  hole  in  foe  stream. 


Figure  3.  The  first  glider  of  channel  B  pass  Figure  4.  Final  result 

while  foe  solitary  second  glider  of  channel  B 
is  destroyed. 
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Abstract  The  use  of  Minimum  Average  Correlation  Energy  Filters  for  the  recognition  of  defocused 
images  is  studied.  We  propose  to  adapt  the  filter  simultaneously  to  the  original  and  to  the  blurred 
images  to  improve  the  recognition  process.  In  this  way  the  value  of  the  correlation  peak  is  stable  for 
a  wide  range  of  blurring  in  the  input  image. 

The  input  image  in  an  optical  correlator  may  be  degraded  by  different  sources  of  blurring.  A 
common  degradation  is  defocus.  The  theoretical  performances  of  several  correlation  filters  have  been 
previously  studied[1].  The  filter  was  designed  from  the  original  image  (non  defocused).  As  expected, 
the  autocorrelation  peak  decreases  as  the  blurring  increases,  and  the  discrimination  capabilty 
decreases  similarly.  In  Ref.  1  the  study  was  done  with  the  Classical  Matched  Filter,  Phase  Only  Filter, 
and  Inverse  Filter.  Recently  a  similar  study  was  carried  out  for  a  joint  transform  architecture^].  One 
possibiSty  to  improve  the  performance  of  the  correlation  method  is  the  use  of  Minimum  Average 
Correlation  designs  as  the  MACE  and  M INACE  filters[3].  With  these  designs  it  is  possible  to  control  the 
correlation  at  the  origin  with  different  input  objects  (called  the  training  set).  This  allows  to  include  the 
blurred  images  as  part  of  this  training  set  and  therefore  to  obtain  defocusing  invariant  filters.  We  show 
that  this  is  a  useful  technique  to  improve  the  performance  of  a  MACE  or  MINACE  filter  for  a  wide  range 
of  defocused  scenes  with  as  few  as  one  blurred  image  for  each  target  pattern. 

Figure  1  shows  the  original  object  to  be  detected  and  two  blurred  versions.  We  show  also  the 
two  other  objects  to  be  rejected.  The  butterflies  are  54x64  pixels  in  size  zero-padded  to  a  total  of 
128x128  pixels.  The  point  spread  functions  of  the  blurring  are  approximated  by  circles  of  diameter 
d«1(non  defocused),  4.16,  and  20.8  pixels  respectively. 


Figure  1 

Figure  2,  and  3  represent  the  values  of  the  correlation  peaks  obtained  with  different  filter 
designs  as  a  function  of  the  defocus.  Fig.2  corresponds  to  MACE  filters  and  Fig.3  corresponds  to 
MINACE  filters.  In  both  figures,  fines  A  correspond  to  a  filter  designed  to  give  a  central  correlation  value 
1  with  the  non-defocused  target  and  0  with  the  two  non-targets.  A1  is  the  autocorrelation  fine  and  A2, 
and  A3  to  the  crosscorrelations  peaks.  Although  the  discrimination  capabilty  is  good,  we  see  that  the 
correlation  value  decreases  rapidly  as  the  defocus  increases,  being  the  MACE  less  resistant  than 
MINACE.  . 

This  lack  of  resistance  to  defocusing  in  the  MACE  design  when  no  blurred  images  are  included 
in  the  training  set  is  due  to  the  high  values  of  the  filter  components  corresponding  to  frequencies  of  low 
energetic  content  in  the  training  images.  Since  it  can  be  assumed  with  wide  genera Rty  that  these  low- 
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energy  components  are  those  of  high  frequency,  the  MACE  fitter  win  be  very  sensitive  to  even  smal 
changes  in  the  high  frequency  components  of  the  input  images.  On  the  other  hand,  the  defocus  of  an 
image  mainly  affects  to  its  high  frequency  components  (the  effect  can  be  approximated  to  a  low-pass 
filter)  being  the  error  largely  ampMed  and  thus  producing  significative  variations  in  the  correlation  peaks. 
The  MINACE  filter,  athough  designed  with  the  noise  resistance  in  mind,  performs  better  due  to  the 
Imitation  imposed  to  the  maximum  value  of  a  filter  component  corresponding  to  a  low-energy  frequency. 
A  better  understanding  of  this  behavior  may  be  used  to  design  more  robust  filters  in  the  future. 

Since  the  MACE  and  MINACE  filter  design  methods  alow  multipie  target  correlation  conditions, 
we  can  use  a  defocused  object  as  one  of  the  targets  and  we  can  impose  that  the  central  correlation 
value  with  the  non-defocused  and  with  the  defocused  target  to  be  the  same.  Curves  labeled  B 
correspond  to  the  use  a  defocused  (diameter*4.16)  object  together  with  the  non-defocused  one.  B1 
corresponds  to  the  autocorrelation  and  B2,  and  B3  to  the  crosscorrelation.  Curves  C  show  simitar 
results  when  using  the  most  defocused  object  (diameter*  20.8)  in  the  filter  design.  Of  course  the 
autocorrelation  values  with  the  objects  included  in  the  design  are  the  same.  The  interesting  feature  is 
that  when  the  most  defocused  image  is  used  in  the  filter  design,  the  autocorrelation  values  for 
intermediate  values  of  defocusing  also  take  almost  the  same  value.  The  discrimination  capabiity  is  high 
and  the  same  for  all  defocusings,  and  the  absolute  intensity  of  the  autocorrelation  peak  is  still  quite 
high. 


MACE  FILTER  MINACE  FILTER 


Figure  2  Figure  3 
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Abstract 

An  optical  system  for  the  Haar  wavelet  transform  of  binary  images  based  on  a  time- 
division-multiplcxing  technique  is  described.  We  show  that  a  simple  image  differentiator 
can  handle  the  bipolar  wavelet  function  efficiently  in  the  system. 

Recently,  various  optical  architectures  of  the  wavelet  transform  v  ,  ,.<»ve  been  investi¬ 
gated  to  get  fast  transform  results  by  taking  advantage  of  parallelism  of  optics1.  Following 
Yang  et  al .2,  we  performed  experiments  on  the  optical  Haar  WT  using  a  shadow  casting 
system.  To  handle  bipolar  nature  of  the  wavelet  functions,  Yang  et  al.  used  a  polarization 
encoding  technique  in  their  shadow  casting  system  with  a  coherent  light  source.  Through 
a  properly  rotated  analyzer,  the  phase  difference  between  the  lights  from  the  positive  and 
negative  parts  of  the  wavelet  function  becomes  x,  by  which  the  amplitude  subtraction  is 
obtained  in  their  experiment.  However,  we  could  not  obtain  good  amplitude  subtraction 
results  because  the  sum  of  two  coherent  beams  with  a  phase  difference  x  still  interfered 
constructively  at  higher-order  positions  in  the  output  focal-plane. 

To  represent  bipolar  images  in  incoherent  systems,  area  encoding,  wavelength  multi¬ 
plexing,  and  time  division  multiplexing  (TDM)  techniques  have  been  commonly  used3-5. 
In  these  schemes,  an  additional  subtraction  process  is  required  after  the  two  outputs  of 
different  signs  are  obtained.  Here  we  adopt  the  TDM  technique  and  show  that  a  simple 
parallel  image  differentiator®  can  be  used  efficiently  for  the  subtraction. 

The  Haar  WT  of  a  2-D  image  s(x,  y)  is  mathematically  described  by 

where  h(x,y)  is  the  mother  Haar  wavelet  function,  a  and  b  are  dilation  factors,  and  p  and 
q  are  shifting  factors.  Let  us  denote  the  positive  and  negative  parts  of  the  wavelet  function 
b+{x^y)  =  rect(x— 0.5,  y— 0.5)  +  rect(x+0.5,y+0.5)  and  h~(x,y )  =  —  rcct(x+0.5,y— 0.5) 
—  rect(x  —  0.5,  y  +  0.5),  respectively.  Thus  h  =  h+  +  h~.  The  Haar  WT  by  TDM  means 
that  h+  and  |A~|  are  presented  alternately  during  2 jt0  <  t  <  (2 j  +  l)f0  and  (2 j  -f  l)f0  < 
t  <  (2 j  +  2)f„,  respectively,  j  =  0,  1,  2,  •  •  •,  and  t„  is  a  time  duration.  Thus  the  TDM 
version  of  Eq.  (1)  can  be  written  as 

Wtd\i(p,  ?,  a ,  6,  i )  =  W+(p,  q,  a,  b)  [u(f  -  2jl„)  -u(t-  (2 j  +  l)/0)] 

j 

+lV~(p,  9,  a,  b)  [u  (t  -  (2 j  +  l)Q  -u(t-  (2 j  -f  2)*,)]  (2) 
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where  W*(p,q,a,b)  =  (l/\/ab)  J  J s(x,y)|5*((x  —  p)/a,(y  —  q)/b)\dxdy  and  u(t)  is  the 
Heaviside  step  function.  Note  that  Wtdm  is  always  unipolar  if  s(x,y)  is  a  unipolar 
function.  To  obtain  W,  W+  —  W~  should  be  executed.  We  can  get  this  subtraction  by 
differentiating  Wtdm  of  Eq.  (2),  i.e., 

=  W^p,q,a,b)6(l)  +  f^(-l)k[YV^(p,q,a,b)-  W-(p,q,a,b))6(t- kla)  (3) 
01  k= 1 

where  6(t)  is  the  Dirac  delta  function.  Practically,  k  =  1,  2  is  enough.  Eq.  (3)  implies  we 
can  get  W+  —  W~  by  detecting  the  pulse  amplitude  after  t  >  0. 

Our  system  that  implements  Eq.  (2)  and  (3)  is  shown  in  Fig.  1.  The  first  part 
that  calculates  Wtdm  is  a  shadow  casting  system  whose  structure  is  similar  to  that  in 
Ref.  2.  The  patterns  for  h+  and  |A“|,  which  are  flipping  alternately,  are  represented 
by  the  light  transmittance  of  a  spatial  light  modulator  (SLM1).  A  sequence  of  discrete 
shifting  factors  (pm,gn)  is  represented  by  the  positions  of  the  point  sources.  Thus  for  a 
given  size  of  the  wavelet  pattern,  a  and  6,  WTDM{Pm,qn,t)  is  detected  and  fed  to  SLM2 
in  the  image  differentiator®  (or  novelty  filter7).  Both  the  two-beam  coupling  and  four- 
wave  mixing  differentiators6,7  detect  only  the  magnitude  of  |H/(pm,^n)|,  because  they 
produce  only  unipolar  light  pulses  whenever  Wtdm  changes.  To  detect  both  the  sign  and 
magnitude  of  W,  we  used  a  beam-fanning  differentiator6,8,  which  produces  positive  light 
pulses  only  when  Wtdm  increases.  If  the  pulses  are  generated  at  t  =  kta  of  even  (odd)  k, 
W  is  positive  (negative).  The  photorefractive  crystal  (BaTiOa)  is  placed  near  the  image 
plane  of  Wtdm  so  that  the  beam  fanning  can  occur  independently  for  every  beam  spot 
of  (pm,9n)-  An  experimental  output  of  the  beam-fanning  differentiator  when  W+(pm,qn) 
^  0  and  W~(pm,qn)  —  0  is  shown  in  Fig.  2. 
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Generalized  Fourier  transforms  of  integer,  fractional  and  imaginary  degrees  cm  be 
implemented  by  lenses.  Optical  units  which  realize  such  basic  transformations  are  shown  to 
be  useful  in  the  implementation  of  more  complex  optical  systems  such  as  imaging  systems 
and  correlators. 


Very  recently,  Lohmann1  has  drown  that  generalized  Fourier  transforms  (GFT's),  defined  on  the  basis  of 
Wigner  functions,  can  be  optically  implemented  by  lenses,  after  generalizing  the  usual  geometries  used  far 
the  implementation  of  the  conventional  Fourier  transform.  Figures  1(a)  and  1(b)  show  those  geometries.  For 
the  configuration  shown  in  figure  1(a),  the  GTF  of  order  p,  fCp),  (p>0),  cm  be  written  in  the  form: 


F.,.y,  l°(x-y)}asJj0(*.y)e  ^  e  dxdy 


0) 


where 


sind  =  jVzi(2f~zi)  ; 


(2) 


and  +=p*/2.  When  collimated  illumination  is  used: 


Zj  =  2^  =  f(l-COsp) 


(3) 


For  the  configuration  of  figure  1(b),  the  expression  of  fCp)  takes  the  form: 

F«,.y,W{°(x'y)}  =  J/°(x*y) r>(“'  yy,>dxdy 

and  d  =  f(l-oosd) 


(4) 

(5) 


The  inverse  GFT's  F^'P)  can  be  obtained  from  equations  (1)  and  (4)  after  substituting  there  j  by  -i.  The 
implementation  of  inverse  GFT's  is  also  realized  with  the  geometries  of  figure  1,  after  substituting  there  the 
positive  lenses  by  negative  lenses  and  interchanging  the  object  and  the  transformation  planes.  The  object  mid 

the  transform  distributions  are  in  this  case  virtual.  According  to  equation  (2),  GFTs  may  have  imaginary 
degrees  for  certain  ranges  of  z\. 

An  important  parameter,  associated  to  any  GFT  F 0>),  is  the  so  called  standard  focal  length,  fj.  The 
expressions  of  fj,  for  the  angle-lens  configuration  and  for  the  two- lenses  configuration,  are  respectively: 

ft-f**;  f,  =  f^ 


•  .  •  .  .  •  . . 


(6) 
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Tam torn*  with  the  same  paw obi  fjl  belong  to  the  mac  family,  meaning  that,  when  they  an 

tnytemcnaed  and  caacadod  in  tandem.  they  satisfy  the  properties  of  the  C3T'»  expressed  by  the  relatioRthip: 

Fw{F»{f(n.y)}}-F«{F‘->{f(i.y)}}-F^{f(x.y)}  & 

Now,  the  aaeaniag  of  standard  focal  length  fj  becomes  dear.  It  represents  the  scale  of  fO),  resulting  from 
the  application,  ia  tandem,  of  other  GPT's  belonging  to  the  same  femily. 

Exact  imaging  optical  systems  and  correlators,  showing  no  phase  curvatures  in  the  planes  of  interest,  can  be 
realized  by  cascading  basic  units  which  implement  fractional  Fourier  transforms  of  the  same  family. 
Particularly,  F^a+P)  may  represent  a  conventional  Fourier  transform  of  an  object  distribution  if  a+{$*l+4n  (n 
is  an  integer),  an  inverted  image  if  a+{J=2+4n,  or  an  erect  image  if  a+(J=4n.  Figure  2  shows  two  examples  of 
imaging  systems  based  an  fractional  GFT's  of  the  same  family.  The  single  lens  configuration  units  are  used 
in  figure  2(a)  with  lenses  with  the  same  focal  length,  and  two-lenses  configuration  in  figure  2(b).  In  this  last 
case,  it  happens  that  the  exact  conventional  Fourier  transform  F^1)  is  formed  in  plane  it;  the  system  is 
therefore  a  correlator  of  focal  distance  f,  =  f/V3. 
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Figure  1(a)  Single-lens  configuration. 


Figure  2(a)  Exact  imaging  system. 
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Figure  1(b)  Two-lenses  configuration 


[1]  A.  W.  Lehman,  'Image  rotation  Wigner  rotation  and  the  fractional  Fourier  transform ",  J.  Opt  Soc.  Am. 
A.  10,2181-2186(1993). 
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Abstract : 

In  the  paper,  a  new  architecture  of  optical  wavelet  transform 
(OUT)  system  with  a  lenslet  array  is  proposed,  and  its  optical 
performance  and  optical  limits  are  analyzed. 


1.  Optical  Wavelet  Transform  system 
1.1  Wavelets  Arrangement 

For  the  fixed  dilation  factor  (a<,at),  wavelets  can  be  represented 
ky  h*ylk: 


ixyik 


(x-xj)  (y-yk)  j 


(D 


It  can  be  considered  as  a  4-D  matrix,  and  the  wavelet  matrix (WM) 
can  be  arranged  with  a  2-D  array  (indexed  by  i,k  )  of  2-D 
submatries  (indexed  by  x,y).  These  wavelets  are  displayed  on  a  high 
resolution  color  video  monitor  or  a  LCTV  addressed  by  a  CRT  monitor 
according  to  the  space  arrangement (Fig. l) . 

1.2  lenslet  array 

An  optical  device  to  provide  optical  connection  between  input 
image  and  wavelets  displayed  on  video  monitor  is  needed  in  OWT.  We 
have  designed  an  array  of  lenslet  with  N*N  lenses,  and  each  lens 
images  a  specific  wavelet  submatrix  onto  the  input  image  which  is 
a  SIM  with  N*N  binary  pixels,  thus  the  submatrix  of  wavelets  can  be 
superposed  to  the  input  image,  and  the  transmitted  beam  from  the 
SIM  are  collected  onto  the  output  plane  to  form  a  N*N  pixel  array 
which  represents  N*N  products  of  the  2-D  wavelet  submatries  with  a 
2-D  processed  image  (Fig. 2  and "fig* 3) .  In  the  output  plane,  each  of 
pixel  indexed  by  the  translation  factor  (x(,yj)  corresponds  to  a  WT 
of  the  input  image  under  one  wavelet  indexed  by  the  same 
translation  factor  and  a  specific  dilation  factor. 

For  the  wavelets  with  both  positive  and  negative  values,  we  need 
a  subtraction  implemented  by  computer  to  obtain  optical  results. 

2.  Optical  limits  of  the  OWT  system 

Since  the  OWT  system  is  designed  on  the  basis  of  geometrical 
optics,  the  diffraction  and  aberration  of  lenslet  array  must  be 
considered  when  the  SLM s  with  large  space-bandwidth  product (SBW) 
are  used  to  be  represented  the  wavelet  mask  and  input  pattern  in 
the  optical  system;  and  SBW  of  WM  affects  the  number  of  lens  in  the 
lenslet  array  which  is  the  one  of  the  most  valuable  factors  for 
evaluating  performance  of  the  OWT  system. 

In  Ref. 5,  Dr.  Sakano  et  al.  analyzed  optical  limits  for  one  kind 
of  lenslet  array  used  for  optical  interconnections,  the  general 
method  can  also  be  used  in  describing  the  limits  of  our  OWT  system. 
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Because  off-axis  state  lenses  are  used.  In  this  case  astigmatic 
aberration  is  dominant.  Thus  we  analyze  the  astigmatic  aberration; 
on  the  other  hand,  the  diffraction  limits  have  to  be  considered. 

Let  A  and  0  represent  the  wavelength  of  light  and  the  diameter  of 
lens,  denotes  the  upper  limit  of  the  number  of  lens  in  lenslet 
array,  and  n  denot  es  the  refractive  index  of  the  lens.  The 
following  condition  is  satisfied: 

*  0.468(-£)^ 


N0t^O  .  468  J  3 


3n+l 
4  n 


The  condition  implies  that  N#f  is  determined  by  geometric  factors 

of  lenses. 
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The  method  is  suggested  of  conjugated  wave  formation  with 
controlled  contrast  inversion  of  images  of  2-D  binary  optical 
matrices.  The  optimum  conditions  for  realization  of  phase 
filtration  of  object  field  are  established. 

By  means  of  computer  simulation  methods  the  peculiarities  are 
considered  of  light  field  transformation  at  phase  filtering  of  Fourier 
spectrum  in  the  scheme  of  DFWM  wavefront  conjugation. 

Phase  conjugation  mirror  constitutes  a  key  element  of  optical  setup 
under  consideration  [l]  which  consists  of  a  nonlinear  layer  as 
photosensitive  matter  together  with  count erpropagating  pump  beams  and  a 
phase  filter  just  behind  the  layer.  Phase  filter  is  inserted  into  one  of  the 
pump  beams  and  there  is  a  small  central  zone  on  it  which  shifts  an 
appropriate  pump  field  by  some  arbitrary  phase  with  respect  to  remainder 
parts  of  the  field.  An  object  beam  is  formed  by  plane  wave  illumination  of 
amplitude  transparency.  A  resulting  beam  comes  through  a  positive  lens  which 
is  set  at  an  arbitrary  distance  from  the  transparency  and  then  is  focused  on 
the  nonlinear  layer  placed  at  the  focal  plane  of  the  lens. 

At  the  moment  of  three  beams  action  a  conjugated  beam  is  generated  by 
DFWM  which  propagates  into  opposite  direction.  The  resulting  field  in  the 
plane  just  before  transparency  is  described  by  the  expression 

E  Cp)  m  (\f)  2  t(p  )*(p  )  ejqp{-ikf~2  (p-p  )p  }d*p  d*p  .  (1) 

C  JJ  T  *  twht 

Function  tip)-  t+T<p),  where  r(p)  is  a  variable  part  relative  to  the 
0 

average  value,  ^escribes  the  amplitude  transmittance  of  the  transparency; 

*(p) «*  + (1-*  )JI(p) ,  where  ♦  is  phase  delay  exp (ip),  describes  the  n-form 
0  0  0 

phase  filter;  f  is  the  focal  length  of  the  lens;  k  »  2ir/X  is  the  wave 
number . 

If  <p  -  n  (sign  of  the  field  amplitude  is  inverted  in  a  limited  region 
near  the  optical  system  axis) ,  the  expression  (l)  describes  contrast 
inverted  image 

E  (p)  «  t  -r (p) +S(p) .  (2) 

C  0 

S(p) -2 [t (p) *3{n(p) } ]  is  an  additional  term  ,  which  depends  on  the  filter  as 
well  as  on  the  object  configuration.  An  efficiency  of  transformation  is 
depicted  in  Fig.l.  For  values  of  less  than  0,25  a  contrast  inversion  is 

not  obtained  because  it  is  an  intensity  (not  amplitude)  of  inverted  input 
matrix  that  is  recorded  in  the  trasparency  plane. 

Under  certain  conditions  when  Fourier  plane  of  the  lens  doesn't 
coincide  with  the  PCM  plane  a  partial  contrast  inversion  of  object  image  may 
be  obtained.  In  this  case  conjugated  field  in  the  object  plane  is  described 
by  the  expression 
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S  (p)  •Const%{  I  ]g(p-p  ,d)$(p  )  G  Cp  -p  ,  d)  5  { ttp  )  }dJp  d  2~p  } . 

C  ||  ^  **  ^  H  PWF 

where  G(p  -p^.d)  -  (iXd)  exp{  (iJc/2d)  |p  -p  )  }  is  Fresnel  transmittance  of 

free  space,  d  -  distance  between  PCM  and  Pourier  plane  of  the  lens,  symbols 
C,  N  and  F  mark  the  planes  of  image,  phase  conjugating  mirror  and  phase 
filter  respectively. 


Original 

matrix 


Contrast 

inverted 

image 


.95 


.75  .50  .25 


.05 


Fig. 1 . 


Efficiency  of  contrast  inversion  transformation  vs 
spatially  averaged  value  of  input  matrix  transmittance  tQ . 
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Fig. 2.  Partial  contrast  inversion  of  the  conjugated  image. 

Calculation  of  field  in  the  object  plane  (Fig. 2)  has  been  carried  out 
using  the  model  of  NxN  matrix  of  2-D  Gaussian  profiles  with  the  following 
values  of  parameters:  N=15,  f»25  cm,  d=»l0  cm,  halfwidth  of  the  filtering 
zone  b=250  pm,  halfwidth  of  Gaussian  function  a«50  pm,  period  -  h«150  pm.  As 
it  is  seen  from  Fig. 2,  the  inverted  image  in  the  initial  matrix  image  is 
only  partially  saved.  Dimensions  of  the  inverted  region  depend  upon  the 
distance  between  PCM  and  Fourier  plane  of  the  lens,  whereas  its  position  on 
the  image  depends  on  the  transverse  filter  shift.  There  is  some  transitory 
annular  zone  in  Fig. 2  which  corresponds  to  gaussian  function  of  the  phase 
filter. 

The  method  of  contrast  inversion  in  conjugated  wave  corresponds  to 
logical  NOT  operation  in  binary  matrices  of  optical  signals  processing.  By 
the  same  way  using  four-wave  mixing  for  realization  of  light  fields 
multiplication  together  with  phase  filtering  gives  the  opportunity  to  form 
conjugated  fields,  which  to  the  action  of  logical  operator  AND. 

l.v.N.Ben,  E.V.Ivakin,  A.S.Rubanov.  "Contrast  inversion  of  DFWM  conjugated 
wave  by  phase  modulation  of  the  pump"  Tech. Digest  of  Conf.  on  Wavefront 
Conjugation,  Minsk,  1987,  pp.  131-138  (In  Russian) . 
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Abstract: 

The  frequency  response  of  Low  Resolution  Fresnel  Encoded  Lenses  is  calculated  in  terms  of  all 
parameters  that  characterizes  these  lenses  and  possible  applications  to  pattern  recognition  are 
discussed. 

1.  PSF  of  Low  Resolution  Fresnel  Encoded  Lenses 
When  a  Fresnel  Lens  with  focal  length  f  for  a 
wavelength  X  is  encoded  in  a  pixelated  low  resolution  device, 
with  a  center  to  center  pixel  distance  given  by  ax,  ay,  infinite 
new  focal  regions  appear  at  the  coordinates  (kX,IY)  where  X  * 

Xf/Ax  and  Y  =  Xf/Ay  where  k,l  are  arbitrary  integers!  1 1.  A  low 
resolution  fresnel  encoded  lens  (LRFEL)  seems  an  array  of 
lenses  with  a  size  given  by  XY.  Then,  if  the  device  has  NxM 
pixels  with  a  rectangular  pupil  of  dimensions  L,  -  Nax  and  L, 

-  May,  the  number  of  lenses  appearing  in  the  device  is  given 
by  W,  -  L./X  and  Wy»L/Y.  In  Fig.1  we  can  see  a  LRFEL  with 
W,  -  Wy  -  3. 

The  amplitude  distribution  at  a  k,l  focus  for  a  plane 
wave  illumination  is  given  by  (Eq.  20  of  Refill): 

fYi)  “  TI?‘^y[sinc{T^'T^)expti17<x’iar*y,Jy,1]*r*ct('^>'‘^)  (1) 

The  sine  function  comes  from  the  Fourier  transform  of  the  rectangular  pupil  of  the  device  and  the 
reel  function  defines  the  pixel  of  dimensions  ax' Ay1.  Another  useful  parameters  are  c„«  Ax'/Ax  and 
Cy  — Ay'/Ay. 


Fig.1.  LRFEL  with  W.-W.-3. 


2.  Coherent  transfer  function  for  a  (k,l)  focus 

In  order  to  calculate  the  frequency  response  we  must  perform  the  Fourier  transform  of  Eq.  1  .Taking 
as  a  natural  unity  of  length  the  pixel  size  we  obtain: 

Hi x.i  <9ir»  9,)  K  X-  Cjy*zect  <  )  slnc{cx.gx,  cy.  gy)  (2) 

For  a  (k.1)  focus,  the  rect  function  is  shifted  k  in  the  g,  direction  and  I  in  the  gy  direction.  In  a  general 
case  (any  shape  of  the  pupil  or  the  pixel)  it  will  happen  the  same,  we  will  have  the  integral  of  the 
Fourier  transform  of  the  pixel  function  multiplied  by  the  shifted  pupil  function.  Now,  let  us  study 
some  different  cases: 

First  we  consider  (k-l-0).  In  this  case  the  cut-off  frequency  is  determined  by  the  rect 
function  at  the  frequency  (in  pixels'1)  gM  *  W,/2,  ge  y  *  Wy/2.  The  first  zero  for  W,  >  2/c,  or  Wy 
>  2/c,  of  H  is  given  by  g^,  - 1  /c,  g*,  - 1  /cy.  If  W,  <  2/c,  or  Wy  <  2/cy  then  go,  -  gM  or  g,,,,  -  gfcy. 
For  smaller  values  of  the  focal  length  H  take  negative  values  and  inversions  of  contrast  may  happen. 
For  c,->0  and  cy->0  we  obtain  H  corresponding  to  the  infinite  resolution  case  (with  a  diffraction 
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diffraction  efficiency  tending  also  to  zero).  Then  the  limitation*  imposed  for  the  low  resolution 
condition  is  a  function  of  c.,  the  ratio  between  the  pixel  size  and  the  center  to  center  pixel 
distance. 


We  can  now  make  general  considerations  for  different  kj.  When  c.W.,/2,  cyWr72  >  >  1  then 
the  rect  function  is  large  compared  with  the  sine  function  and  his  effect  will  not  be  appreciable  for 
k<W„/2-1/c,  and  l<Wy/2-1/cv.  In  that  conditions  each  focal  region  verifying  the  inequalities  wilt 
replicate  near  the  same  information.  This  situation  is  sketched  in  Fig  2  for  W„ » W, » 40  and 
c,  *  Cy » 1 .  Note  that  the  inequalities  can  be  verified  for  the  particular  case  of  a  short  focal  lens  with 
c,,Cy-1,  or  a  large  focal  length  with  c*,cy  <  <  1.  When  c,W,/2,  cyWy/2  <<  1,  then  the  rect 
function  is  thinner  than  the  sine  function,  for  k,l>0  H  has  the  aspect  of  Fig.  2. a.  The  H  for  each 
k,l  corresponds  only  to  a  little  window  of  frequencies.  In  such  case  each  focus  will  concentrate 
information  corresponding  to  a  certain  range  of  frequencies.  If  W„  Wy  -  1  then  the  windows  are 
the  ones  sketched  in  Fig  3  In  this  case  each  focus  replicates  different  information  but  all 
information  is  replicated. 


Fig.3  H  for  W.  =  40  and  c„  =  0.3  (Dashed)  and 
0.2  (Solid) 


4.  Application  to  pattern  recognition 

We  have  seen  that  in  certain  conditions  each  focus  can  replicate  different  information  of 
the  object.  The  central  focus  can  replicate  the  information  corresponding  to  a  window  of  low 
frequencies  and  the  others  ones  can  replicate  windows  corresponding  to  high  frequencies.  This  will 
permit  to  compare  separately  the  information  corresponding  to  high  and  low  frequencies  and 
different  kind  of  filters  can  be  used  in  order  to  process  appropriately  this  information. 

We  have  also  seen  that  for  short  focal  length  encoded  each  focus  replicates  near  the  same 
information.  This  opens  the  door  to  apply  different  types  of  filters  for  the  same  image  in  order  to 
improve  the  recognition  process. 
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Abstract  We  propose  a  Phase  only  filter  with  maximum  discrimination  capability. 
The  aero  modulation  state  is  introduced  to  block  certain  region  in  spatial  frequencies  which 
equalize  die  phase  difference  histogram.  The  discrimination  capability  is  increased 
significantly  compared  with  the  POF. 

The  discrimination  capability  (DC)  is  one  of  the  most  important  parameter  to  evaluate 
the  performance  of  a  filter  for  pattern  recognition.  Different  kinds  of  filters  have  been 
proposed  to  increase  this  parameter  The  Phase  Only  Filter  (POF)[l],  Optimal  Filter[2]  and 
other  techniques  [3].  In  [4]  B.V.K.V.  Kumar  and  Z.  Bahri  introduced  the  notion  of  optimal 
support  function  for  a  Phase  Only  Filter  (OPOFs)  in  the  sense  of  maximising  the  signal  to 
noise  ratio  SNR  The  support  function  indicates  which  pixels  in  the  filter  have  magnitudes 
of  1  and  which  pixels  have  zero  magnitudes.  Other  support  functions  to  optimize  different 
criteria  or  multicriteria  have  been  proposed  [5].  In  all  of  these  designs  the  support  functions 
are  calculated  taking  into  account  the  amplitude  distribution.  As  we  will  see  the  phase 
distribution  plays  a  crucial  role  in  dm  discrimination  capability  and,  in  consequence,  we 
propose  a  support  function  to  optimize  the  DC  based  on  the  phase  information. 

In  this  contribution  we  propose  a  method  that  optimizes  dm  discrimination  capability 
by  using  a  POF  with  a  support  function.  The  procedure  is  based  (Hi  modification  of  phase 
difference  histogram  by  blocking  some  frequencies.  We  investigate  the  DC  with  computer 
simulation  and  we  show  the  improvement  in  DC  obtained  with  this  method. 

We  denote  by  t(x,y)  and  d(x,y)  the  target  and  nontarget  respectively,  when  the  target 
(t(x,y»  is  present  at  the  origin,  and  the  nontarget  d(x,y)  is  placed  in  the  scene  at  coordinates 
that  the  maximum  of  the  cross-correlation  is  at  the  origin.  Let  T(u,v)  =  |T|  exp(i<f>J  and 
D(u,v)  —  J  D  |  expO^J  be  their  Fourier  transforms.  When  the  POF  is  used  the  cross- 
correlation  function  in  Fourier  domain  is  given  by: 


CD(u,v)  *  |D(u,v)  |  exp  [  i  A$(u,  v)  ] 

where  A<£(u,v)  =  <t>4( u,v)  -  <f>,(u,v)  is  the  phase  difference.  The  discrimination  capability, 
DC,  is  a  parameter  that  measures  the  ability  of  the  correlator  to  discriminate  between  two 
very  similar  input  objects,  and  it  is  defined  as: 

dc j E  lD( u' v)  l  |V| E  lr<u'  l  ^ 

Let  us  divide  the  2t  phase  interval  in  N  steps  of  width  2t/N.  Let  Sj  be  the  set  of  pixels 
(u,v)  for  which  the  phase  difference  A <t>  is  in  the  interval  A^.,  <  A 4>  £  A 4>},  where  A<t>}  = 
j.54.  Let  us  define  the  weighted  phase  difference  histogram  as: 

Pj=  V  |  D{u,  v)  | 

{u.vfeSj 


we  approximate  the  phase  difference  of  those  pixels  which  belong  to  Sj  by  A<f>j,  the  DC  can 
be  written  as: 
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The  DC  is  optimized  if  we  equalize  Pj**  P^,^,  j«l...N/2.  This  can  be  made  by  blocking 
some  frequencies. 

Figure  1  shows  die  input  scene,  the  target  is  the  letter  F,  and  the  object  to  be  rejected 
is  the  letter  E  (letters  F  is  contained  in  letter  E).  In  figure  2a  and  2b,  we  show  the  original 
and  the  modified  phase  difference  weighted  histogram,  with  N  equal  to  100.  The 
corresponding  correlations  are  shown  in  figure  3a  and  3b.  By  comparing  these  figures  it  is 
shown  that  the  algorithm  proposed  in  this  paper,  enhances  clearly  the  discrimination 
capability  of  the  POF. 


E  F 


Fig.l.  Input  acme. 
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Fig.  3.  Correlation  planes,  a)  With  conventional  POF.  b)  With  POF  and  DC  optimization. 
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With  the  increasing  demands  and  complexity  of  signal  processing  systems,  renewed  interest 
emerged  in  the  set  theoretic  formulations  [1]  applied  to  optimization  problems  such  as  signal 
synthesis  in  pattern  recognition  [2,  3,  4],  Computerized  Tomography  [5],  constrained  deconvo¬ 
lution  [6],  Image  restoration  [7,  8]  etc.  The  task  is,  usually:  “Given  some  a  priori,  corrupted, 
signal,  h° ,  which  is  known  to,  supposedly,  have  satisfied  N  constraint  sets,  try  to  restore  it  to 
a  signal  which  will  satisfy  the  constraint  sets” .  The  framework  is  quite  similar  if  the  task  is  to 
design  a  signal  which  is  to  satisfy  N  constraints. 

Projection  methods  have  been  suggested  for  this  task.  A  projection  of  an  arbitrary  element, 
h,  onto  a  Closed  Convex  Set  C,  is  that  unique  element  in  the  set  closest  to  h,  where  “dose” 
is  measured  by  some  distance  function  d,-.  Namely, 

P£\{h)  -  h!  if  and  only  if  inf  di(hi,h)  ~  d,-(h',h);  h'  €  C, 

*  h\  €Cj 

.  Given  N  sets  C{  one  usually  constructs  operators  of  the  form  Tn+1  =  wn(i)Pct  where 
ton(»)  is  some,  iteration  dependent,  weight  function.  This  leads  to  a  sequence  hn+1  :=  Tn+1(hn). 
One  strives  to  show  that  the  sequence  generated  converges  to  a  solution  satisfying  all  constraint 
sets  which  means  that  lim  hn  6  Co  :=  fl^  C,-.  The  results  known  to  date  are  basically  composed 
of  the  following:  if  Co  ^  0  and  <U  are  constant  for  all  sets  involved  and  all  sets  C,  are  convex 
then  under  some  general  conditions  the  algorithm  converges. 

However,  what  happens  if  the  constraints  are  inconsistent  i.e.  Co  =  9  (which  is  a  frequent  oc¬ 
currence  due  to  detection  error,  slight  mis- characterization  of  the  set  etc.)?  Is  there  convergence 
at  all  and  if  so  to  what?  Also,  how  will  one  project  onto  a  constraint  when  the  constraint  is 
given  indirectly  (implidtly)?  This  may  be  transformed  in  many  instances  to  projection  onto  an 
explicit  set  but  then  the  distance  function  is  modified  ([4])  and  thus  not  all  distance  functions 
are  uniform,  what  then?  Finally,  in  many  Image  recovery  problems  more  than  2  constraints 
exist  (when  one  is  non  convex)  -  Can’t  we  indeed  use  a  projection  base  algorithm  for  this  case 
(classically  (8]  we  are  limited  to  2  sets  if  one  is  non  convex)? 
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Fortunately,  all  of  the  above  problems  may  be  solved  by  use  of  a  very  special  parallel  pro¬ 
jection  method.  It  amounts  to  performing  projections  of  the  current  estimate  onto  all  N  sets 
involved  and  then  taking  a  very  special  average  of  these  projections.  More  specifically: 

1.  Even  if  Co  is  empty,  the  algorithm  converges.  Moreover,  it  converges  to  the  best 
solution  possible  i.e.  that  minimizing  its  distances  (squared)  from  all  sets  involved. 

2.  Projections  onto  individual  sets  may  be  performed  w.r.t  different  distance  functions 
and  yet  convergence  is  assured  to  an  Element  in  Co,  or  to  the  closest  one  to  it. 
Thus  many  implicit  constraint  problems  may  be  converted  to  explicit  constraints 
with  various  distance  functions  which  can  be  handled  by  this  method. 

3.  The  algorithm  converges  for  an  arbitrary  number  of  non  convex  constraints  (the 
SDER  property  introduced  by  Stark  et  al  [8]  is  maintained).  Although  convergence 
is  not  necessarily  to  Co,  and  it  may  get  stuck  at  some  “trap”  point.  In  most  cases, 
however,  convergence  is  to  Co.  In  any  event  it  should  be  realized  that  the  traps  are 
a  result  of  the  non-convex  constraints,  not  the  projections  algorithm. 
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Abstract 

Diffraction  effects  and  photometric  non-uniformities  of  the  spatially  incoherent  shadow¬ 
casting  architecture  are  investigated  when  the  input  and  output  planes  are  pixelized.  The 
acceptable  geometrical  configurations  are  then  derived  precisely. 

Shadow-casting  systems  have  been  known  since  a  long  time,  and  were  first  used  to  analyse 
crystal  lattices  [1,2].  They  provide  a  very  simple  way  to  obtain  the  convolution  or  correlation 
of  bi-dimentional  positive  functions  directly  in  object  space  [3].  They  have  also  been  used 
for  implementing  optical  logic  [4]  and  image  morphology  [5]  operations,  as  well  as  symbolic 
substitution  [6].  The  simplest  shadow-casting  system  is  depicted  in  fig.  1,  and  is  composed  of 
three  planes.  The  first  one  is  a  diffuse  plane  source  which  can  consist  for  example  of  a  CRT,  or 
a  Spatial  Light  Modulator  (SLM)  followed  by  a  diffuser  illuminated  by  a  primary  source.  In  a 
second  plane  at  a  distance  d  is  another  SLM,  and  in  a  third  plane  at  a  distance  p  is  a  screen 
or  an  array  detector.  The  distance  d  between  the  SLMs  is  always  positive,  but  the  distance  p 
between  the  second  SLM  and  the  detector  can  be  either  positive  or  negative,  in  which  case  the 
third  plane  has  to  be  observed  with  an  additional  lens. 

The  basic  system  of  fig.  1  can  be  generalized  [7, 8, 3]  by  incorporating  a  lens  (or  lenses) 
between  the  source  and  intermediate  planes,  and  between  the  intermediate  and  detector  planes. 


SLM1  Diffuser  SLM2  CCD  Camera 


Collimated 

Source 


Figure  1:  lensless  shadow¬ 
casting  architecture. 


The  first  plane  being  a  diffuse  source,  it  is  spatially  incoherent,  and  the  system  is  gener¬ 
ally  assumed  to  behave  approximately  according  to  geometrical  optics  (depending  on  ratios  of 
caracteristic  lengths  in  the  set-up  to  the  wavelength(s)  used).  Even  with  that  assumption,  the 
optical  distances  are  different  for  horizontal  or  tilted  rays  between  the  SLMs.  This  phenomenon 
has  important  consequences  on  the  correlation  operation  since,  due  to  photometric  effects,  the 
optical  flux  is  dependent  on  the  tilting  of  the  rays.  The  consequence  may  be  a  loss  of  translation 
invariance,  and  a  degradation  of  the  measured  correlation  in  comparison  with  the  mathematical 
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correlation  operation.  This  point  is  in  general  emphasized  by  the  imperfect  diffuser  placed  after 
the  first  SLM.  Taking  into  account  this  aspect  leads  to  a  first  domain  for  the  acceptable  values 
of  the  geometric  scaling  of  the  architecture. 

A  second  domain  for  the  acceptable  values  of  the  geomt\.:-  scaling  of  the  architecture  is 
obtained  if  diffraction  effects  are  analyzed.  Indeed,  the  geometrical  optics  description  is  a  first 
approximation,  and  diffraction  needs  to  be  taken  into  account.  Previous  analyses  [9, 10]  of  the 
effect  of  diffraction  on  resolution  did  not  describe  precisely  the  behavior  of  a  shadow-casting 
system  in  presence  of  discrete  elements  such  as  the  pixelized  SLM. 


G 


Figure  2:  definition  of  acceptable  do¬ 
main  for  the  geometrical  parameters  of  the 
shadow-casting  architecture. 


Furthermore,  if  a  CCD  is  used  in  the  output  plane,  the  continuous  output  is  as  well  sampled. 
Then  the  system  is  an  image  processor,  for  which  there  exist  discrete  relations  between  output 
and  input  images. 

The  results  of  our  theoretical  analysis  of  diffraction  and  photometric  effects  will  thus  be  pro¬ 
vided  as  a  transformation  between  discrete  images  (the  inputs  and  the  output).  The  acceptable 
choices  of  the  geometric  scaling  of  the  architecture  are  then  rigourously  defined.  This  acceptable 
domain  is  shown  in  fig.  2,  where  the  ratio  G  is  defined  by:  G  =  \d  +  p|/|p|.  The  influence  of 
diffraction,  which  was  only  addressed  previously  with  rough  estimations,  had  led  to  pesssimistic 
predictions  as  will  be  shown  in  the  presentation.  Finally  we  will  discuss  the  practical  implica¬ 
tions  on  the  architecture  of  these  considerations  in  relation  with  realistic  applications.  Indeed, 
it  will  be  shown  that  classical  situations  in  which  the  reference  image  is  smaller  than  the  input 
image  are  favorable. 
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Abstract 

The  principles  of  time-domain  holography  of  short  laser 
pulses  based  on  wave  interactions  in  spectrally  nonselective 
medium  are  considered.  Using  these  principles  the  system  of 
methods  may  be  proposed  for  flexible  controlling  and 
processing  fast  optical  signals. 

Summary 

He  review  the  principles  underlining  the  possibilities  of 
recording  and  reconstruction  of  time-dependent  radiation  (wave 
packets) ,  while  using,  as  the  normal  holography,  interference 
and  diffraction  of  waves,  and  spectrally  nonselective 
recording  media.  These  principles  allow  to  develop  the  system 
of  methods  for  flexible  controlling  and  processing  fast 
optical  signals  ranging  in  duration  from  femtoseconds  to 
nanoseconds . 

From  the  methods  proposed  the  most  promising  one  is  the 
holography  of  time-varying  signals  based  on  spectral 
decomposition  of  light  (spectral  holography) .  In  this  case  we 
use  interference  and  diffraction  of  waves  formed  during 
spatial  spectral  decomposition  of  short  light  pulses  (spectral 
decomposition  waves)  by  a  normal  spectral  device. 

Recording  interference  pattern  of  spectral  decomposition 
waves  of  two  optical  pulses  (a  signal  and  a  reference  pulse) 
allows  to  store  the  temporal  optical  signal  (the  signal  pulse) 
in  the  form  of  a  spectral  hologram.  The  reconstruction  of  the 
stored  signal  in  the  form  of  its  real  replica  occurs  by 
diffraction  of  the  spectral  decomposition  wave  of  a  reference 
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puls*  fro*  the  spectral  hologram.  It  Is  also  possible  to 
obtain  a  time-reversed  replica  of  the  stored  signal. 

Spectral  holography  serves  the  basis  for  time-domain 
Fourier  optics  that  can  be  developed  using  the  analogies  with 
the  normal  Fourier  optics  and  Fourier  transform  holography. 

The  possible  applications  of  spectral  holography  are 
analysis,  synthesis,  time-reversal,  recognition,  optimal 
filtering,  and  optimal  compression  of  short  temporal  optical 
signals. 

The  methods  of  dynamic  spectral  holography  and  spectral 
nonlinear  optics  are  also  proposed.  Spectral  decomposition 
waves  of  several  light  pulses  may  be  superimposed  in  a  common 
dynamic  light-sensitive  medium  (optical  nonlinear  medium) . 
That  makes  possible  the  nonlinear  interaction  of  spectral 
decomposition  waves  and,  also,  the  interaction  of  spectral 
decomposition  waves  with  inhomogeneous  monochromatic  waves.  As 
a  consequence,  various  real-time  transformations  of  ultrashort 
temporal  signals  become  possible.  In  particular,  it  is 
possible  to  nix  spatial  and  temporal  signal  and,  using  such 
mixing,  to  obtain  fast  space-to-time  and  time-to-space  signal 
conversions . 

Methods  of  dynamic  spectral  holography  and  spectral 
nonlinear  optics  may  be  used  for  highly  parallel  processing  of 
optical  data  streams.  In  particular,  dynamic  space-time 
conversions  may  be  used  for  for  time-  and  code-division 
multi/demultiplexing  of  broad-band  data  streams  and  fast 
optical  vector-matrix  multiplications  in  communication  and 
computing  systems. 

The  experimental  results  on  recording,  reconstruction, 
time-reversal,  and  holographic  recognition  of  ultrashort 
optical  pulses  by  spectral  holography  methods  are  presented. 
Also  presented  are  the  experimental  results  on  real-time 
conversion  of  ultrashort  optical  pulses  into  the  spatial 
signals  using  three-wave  nonlinear  interaction  of  spectral 
decomposition  waves  and  monochromatic  waves. 

See  also:  Yu.T.Mazurenko,  Opt.  Eng.,  V.  31,  739-749 
(1992)  . 
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Abstract.  The  concept  of  "spatial  amplification"  based  on  transverse 

effects  in  optical  bistability  is  proposed.  The  possibility  is  shown 

experimentally  to  detect  with  the  help  of  "spatial  amplifier"  weak  optical 

7  8 

signals  with  the  power  that  is  10  -10  tines  less  than  the  power  of  noise. 

In  present  contribution  the  concept  is  proposed  for  increasing  the 
signal-to-noise  ratio  in  2D  optical  information  processing  systems  which 
is  based  on  application  of  transverse  effects  in  optical  bistability.  This 
concept  allows  to  develop  the  technique  for  identification  of  separate 
information  pixels  in  wlde-aperture  2D  systems  for  information  processing. 

The  basic  idea  of  the  above  concept  is  to  provide  the  favorable 
conditions  for  the  rise  and  propagation  of  switching  waves  in  optical 
bistable  layers  if  the  switch-on  threshold  has  been  exceeded  at  least  in 
one  local  area.  This  leads  to  a  many-fold  increase  of  output  light  power 
up  to  the  highest  value  which  corresponds  to  the  total  switching  of  the 
whole  bistable  layer. 

In  a  number  of  algorithms  for  optical  information  processing  one 
needs  to  detect  the  presence  or  absence  of  information  at  least  in  one 
pixel  of  wide-aperture  matrix.  Since  the  "0"-level  signal  in  optical 
systems  is  not  equal  to  zero,  the  power  of  background  signal  coming  from 
the  whole  matrix  is  many  times  higher  than  the  power  of  information  signal 
In  an  only  separate  pixel. 

Using  OB-matrix  formed  on  an  optically  uniform  nonlinear  layer  one 
can  switch  on  not  only  the  specified  pixel  but  its  nearest  surrounding 
as  well,  and  thus  the  resulting  out',  -t  signal  to  be  registered  can  be 
increased  many-fold. 
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The  estimates  of  the  gain  factor  of  such  a  "spatial  brightness 
aaplifier",  defined  as  the  ratio  of  the  output  light  power  change  to  the 
power  of  the  optical  lnforaatlon  signal  that  has  caused  this  change,  give 
the  values  of  107-loV 

The  technique  proposed  has  been  proved  by  aodellng  the  "spatial 
brightness  amplifier"  experimentally  using  a  thin-fils  bistable 
ZnS- interferometer.  Basic  results  are  presented  on  Figures  1,2. 

Fig.  1  shows  background  light  distribution  of  the  whole  illuminated 
area  (1),  intensity  profile  of  the  signal  bean  (2),  and  the  output 
intensity  profile  (3)  which  gives  the  evidence  of  the  signal  somewhere 
within  the  switched  area.  On  Fig.  2  time  dependencies  are  plotted  for  the 
power  of  output  light  beam  when  there  is  only  background  and  no  signal 
(1),  for  the  power  of  the  signal  beam  to  be  detected  on  the  above 


r,  pm 


Power,  arb.unlU 


background  (2),  and  power  of  the  output  beam  while  detecting  local  signal 
with  the  maximum  intensity  that  makes  up  10X  (3)  and  20%  (4)  of  the 
background.  The  experimentally  obtained  gain  factor  of  our  "spatial 
brightness  amplifier"  of  about  5  103  has  been  limited  by  the  power  of  the 
laser  available. 

The  concept  of  "spatial  brightness  amplification"  has  been  appl led 
also  to  analyze  output  light  fields  of  the  coherent  correlation  processor. 
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Abstract  A  new  form  of  heterodyning  in  an  all-optical  bistable  system,  related  to  stochas¬ 
tic  resonance,  in  which  a  heterodyne  signal  can  be  enhanced  by  adding  noise,  is  reported. 

The  phenomenon  of  stochastic  resonance  (SR)  in  which  a  signal-to-noise  ratio  (SNR) 
can  be  increased  by  adding  noise,  [1]  has  attracted  much  attention  recently  (see  [2])  . 
Most  of  the  data  have  been  obtained  for  bistable  systems  driven  by  noise  and  by  a  low- 
frequency  periodic  force.  However,  the  noise-induced  increase  of  the  SNR,  and  also  the 
frequency-selective  response  of  bistable  systems,  make  it  interesting  to  apply  the  idea  of 
SR  to  heterodyning  in  which  two  high-frequency  fields  (an  input  signal  and  a  reference 
signal)  are  mixed  nonlinearly  to  generate  a  heterodyne  signal  at  the  difference  frequency. 

In  this  paper  we  report  and  discuss  a  new  form  of  the  phenomenon,  noise-enhanced 
optical  heterodyning.  The  enhancement  rather  than  suppression  by  noise  of  the  hetero¬ 
dyne  signal  has  been  investigated  theoretically,  experimentally  and  by  means  of  analogue 
electronic  simulation. 

We  are  investigating  a  double-cavity  membrane  system  (DCMS)  which  displays  all- 
optical  bi-  and  multistability  [3].  To  a  reasonable  approximation  [3(a)]  the  dynamics  of 
the  DCMS  driven  by  two  modulated  laser  beams  at  different  wavelengths  is  described  by 
a  Debye  relaxation  equation  for  the  phase  gain  <f> 

4  +  -(*  -  *o)  =  ( Io  +  Iin  cos(u*  +  fl)t  +  6I(t))M(<(>)  +  /re/(co8Wbt  +  1)  (1) 

T 

where  is  the  phase  of  the  DCMS  in  the  “dark”;  6I(t)  is  a  zero-mean  random  term  in  the 
modulated  light  intensity;  and  7re/  and  /,„  are  the  amplitudes  of  the  periodic  modulation 
of  the  beams.  The  frequency  detuning  is  assumed  to  be  small,  |fl|  C  t-1  <  wo.  The 
function  M(<j>)  determines  the  nonlinearity  of  the  response  of  the  DCMS.  It  is  convenient 
to  rewrite  (1)  in  the  form 

x  +  U'{x)  =  IrejF(x)  cos  wot  +  Itn  cos(u\,  +  R)f  +  61{t)  (2) 

*=  /*X>  d<t>M-'(<t>y,  U(x)  =  Iox+  I™  d<}>{ITt,  +  {4>-<h))M-2{<t>),  (3) 

Jo  Jo 

(F(x)  s  M~l(j>(x))).  Eq.  (2)  is  a  stochastic  differential  equation  for  the  variable  x  driven 
by  multiplicative  reference  (oc  Iref)  and  additive  input  (a  /,„)  signals.  It  is  the  signal  at 
frequency  R  that  is  to  be  revealed  via  heterodyning. 

In  view  of  the  possible  applications  we  will  allow  for  a  zero-mean  Gaussian  noise 
SI(t)  in  (1)  that  consists  of  two  independent  components,  at  low-  and  high-frequencies 
respectively: 


MP25/78 


sm  =  ft  At)  +  A/M,  A/M  =  Re  (A/(<)  «cp(— *WoO)  , 


(4) 


where  both  /»/(<)  and  /*/(t)  have  a  cut-off  frequency  w  ~  we  >  s  U"(x i,j)  (xt^  are 
the  stable  states  of  the  system  in  the  absence  of  noise  and  periodic  drive). 

For  mq  >  r~l  the  motion  consists  of  fast  oscillations  at  frequency  wq  superimposed  on 
a  slower  motion,  and  the  heterodyning  can  be  characterized  by  the  low-frequency  signal 
at  the  output,  x^(t)  =  x(t)  (the  overbar  stands  for  averaging  over  the  period  2x/u>q).  In 
the  spirit  of  [4],  to  first  order  in  the  slow  part  of  the  coordinate  is  given  by 

*<•« +  {/'(*<*'>)  =  .4^5  unfit  +  /<">((),  A  =  (5) 

The  predictions  of  this  theory  have  been  tested  quantitatively  by  analogue  simulation 
for  the  simplest  nontrivial  case  of  overdamped  motion  in  the  symmetric  Duffing  potential, 
for  F(x)  =  x.  In  this  simulation  noise-enhanced  heterodyning  has  been  demonstrated  for  I 

both  types  of  noise,  in  good  agreement  with  theory.  The  SNR  dependence  on  n.,  n„, 

Wo1  and  ft  were  investigated  for  white  noise. 

The  DCMS  consists  of  a  thin  GaSe  semiconductor  film  separated  from  a  dielectric 
mirror  by  a  metal  diaphragm.  The  incident  radiation  from  the  Ar  laser  providing  the 
input  signal  is  propagated  normally  to  the  mirror  and  is  modulated  by  superimposed  < 

random  and  periodic  (at  frequency  Wb  ±  ft)  signals  with  an  electro-optical  modulator.  It 
had  been  shown  previously  that  this  system  exhibits  standard  SR  for  c  r”1,  ft  =  0 
[5]. 

We  have  now  shown  experimentally  that,  in  the  presence  of  additional  radiation  from  a 
HeCd  laser  modulated  by  a  chopper  to  induce  periodic  heating  of  DCMS  at  the  frequency  < 

of  reference  signal  ljq,  this  system  displays  noise-enhanced  optical  heterodyning  for  u*j 
r-1  >  ft.  It  has  been  found  in  particular  that  there  is  a  range  of  noise  intensity  where 
the  SNR  of  the  heterodyne  signal  at  ft  is  strongly  increased  by  adding  noise.  Below  and 
above  this  range  the  SNR  decreases  with  the  increasing  noise  intensity  D.  This  behaviour 
coincides  with  that  observed  in  standard  SR,  cf.  [5].  < 

A  semiquantitative  description  of  this  new  phenomenon  is  provided  by  Eqs.  (l)-(5), 
with  an  absorption  coefficient  in  the  form  given  in  [3(a)].  A  more  detailed  comparison 
with  the  theory  will  require  computer  simulation  of  the  dynamics  of  the  DCMS. 

The  work  was  supported  by  the  Science  and  Engineering  Research  Council,  by  the 
European  Community,  by  the  Royal  Society  (London),  and  by  the  Gosstandart  of  Russia.  * 
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Information  is  related  to  unexpectedness  and  nonredundancy.  Efficient  processing  of 
information  should  use  minimum  redundancy.  I  show  that  famn/f snout  can  minimiae 
computational  load  for  algorithms  whose  touch  complexities  are  less  than  their  computational 
complexities. 


Summary 

There  are  many  algorithms  for  most  complex  operations  such  as  solving  Ax  “b.  The  bases 
for  preference  in  digital  computing  are  usually  computational  complexity  and  match  to  die 
available  hardware.  For  analog  optical  computing  we  should  ask  die  same  questions  of  its 
algotecture/archirithms. 

In  conventional  digital  computing,  only  die  computational  complexity  of  algorithms  is 

considered.  For  instance.  Ax  -  b  sedation  is  an  o(N3)  problem.  Doubling  die  size  of  A 
requires  eight  times  more  calculations. 

Recendy  computer  scientists  have  started  to  attend  to  die  match  of  algorithm  to  computer. 
This  has  led  to  die  concept  of  "touch  complexity".  This  is  a  measure  of  how  many  times  we 
must  go  to  the  memory  to  get  the  information  needed  for  a  given  algorithm.  For  some 
algorithms  and  some  problems,  the  touch  complexity  is  less  than  the  computational 

complexity.  Fdr  example,  the  touch  complexity  for  some  Ax  «b  solvers  is  o(N2). 

I  now  propound  a  diesis  which  I  have  not  proved  but  whose  probable  truth  I  will  argue  by 
showing  several  examples.  It  is  dus:  "The  space- time  complexity  of  an  analog  optical 
processor  can  always  be  driven  to  the  touch  complexity". 

The  way  we  do  this  is  with  fanin/fanoqt  That  is  the  datum  to  be  used  multiply  is  represented 
as  an  SLM  pixel  (or  its  equivalent)  and  all  operations  on  that  datum  are  performed 
simultaneously.  I  will  illustrate  this  with  two  special  cases,  leaving  die  proof  of  the  thesis  to 
others  or  for  me  at  a  later  time. 


First,  I  consider  Ax  =  b:  a  problem  of  0(n3)  computational  conqdexity  and  0(n2)  touch 
complexity.  I  will  show  that  this  can  be  solved  in  analog  optics  with 


S = 0(n2)  spatial  complexity  , 
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T  =  0(n°)  temporal  complexity  and  , 

ST  =  o(nj  )  space-time  complexity  . 

The  other  0(N)  comes  fifom  fanin/fanout. 

Second,  I  consider  matrix-matrix  multiplication:  a  problem  of  0(N3)  computational 
complexity  and  o(n3)  touch  complexity.  Yet,  I  can  solve  it  by  analog  optics  with 

s=o(n2)  , 

T  =  0(N°)  .and 

st=o(n2)  . 

The  remaining  Q(N)  comes  from  fanin/fanout. 

My  hope  is  that  this  work  can  give  us  insight  into  Much  problems  are  best  suited  for  analog 
optical  computation. 
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Abstract 

A  parallel  sorting  algorithm  and  its  implementation  on  a  3-D  multistage  optoelectronic  sorting  network 
are  presented.  The  network  operates  on  bit-slices  of  wards  and  combines  regular  structure  and  simple 

interconnections  between  stages. 


Introduction 

Tomographic  (bit-sliced)  sorting  is  a  parallel  pipelined  algorithm  that  can  be  mapped  directly  on  a  3-D  com¬ 
puter  architecture  that  uses  optical  interconnections  to  propagate  data  and  control  bits  from  one  processing  plane 
to  another.  The  algorithm  is  a  combination  of  the  odd-even  transposition  sort  and  the  standard  radix  sort  [1].  In 
this  paper,  we  describe  the  algorithm  and  the  structure  and  functionality  of  the  3-D  sorting  network.  We  also  dis¬ 
cuss  how  this  network  can  be  implemented  as  a  multistage  architecture  similar  to  the  3-D  optoelectronic  computer 
under  development  at  the  Optoelectronic  Computing  Systems  Center  in  Colorado.  Tomographic  sorting  can  serve 
as  a  test  application  for  this  computer. 


Hardware  Description 

The  3-D  optoelectronic 
computer  under  development 
comprises  multiple  2-D  stages 
of  electronic  processing  element 
arrays,  stacked  in  a  3-D  forma¬ 
tion  (Figure  1).  Each  element  in 
a  2-D  array  can  accept  optical  in¬ 
put  on  a  phototransistor,  process 
the  incoming  signal  electroni¬ 
cally,  and  generate  an  optical 
output  by  driving  a  vertical-cav¬ 
ity  surface-emitting  laser 
(VCSEL).  The  VCSEL’s  output 
becomes  input  for  the  next  stage. 

Computer-generated  holograms 
can  implement  the  interconnec¬ 
tion  patterns  required  for  each 
operation. 

The  tomographic  sorting  network  (TSf I)  operates  on  a  list  of  N  words  with  M  bits  each  and  has  M  processing 
stages.  A  2-D  version  of  a  TSN  is  shown  in  Figure  2.  The  N  words  are  stored  as  M  bit  slices  with  die  k-th  slice 
containing  thek-th  significant  bit  of  every  word.  All  the  bit  slices  are  pipelined  into  the  network,  and  remain  there 
for  a  fixed  number  of  clock  cycles  until  they  are  sorted  and  ready  to  be  transferred  out  and  into  a  buffer.  Sorting 
is  performed  by  rearranging  bits  on  the  same  slice  so  that,  eventually,  words  are  reshuffled  and  end  up  in  ascending 
order  from  top  to  bottom.  After  the  bit  slices  are  loaded  into  the  sorting  unit,  only  control  signals  are  transmitted 
from  one  stage  to  another  to  indicate  any  necessary  bit  exchanges.  Two  types  of  cells  are  present  in  the  TSN:  data 
and  control  cells.  Each  cell  contains  a  flip-flop  to  hold  the  value  of  a  bit  Rows  of  data  cells  are  interleaved  with 
rows  of  control  ceils  as  shown  in  Figure  2.  The  control  cells  are  basically  bitwise  compare-rnd -exchange  modules 
with  some  additional  control  lines.  Optical  signals  propagate  along  the  horizontal  direction,  while  electronic  sig¬ 
nals  traverse  along  die  vertical  direction. 

A  column  of  data  and  control  cells  constitutes  a  stage.  The  data  cells  in  every  stage  will  receive  and  hold 
the  contents  of  the  corresponding  bit  slice.  Notice  that  all  signals  flow  from  left  to  right  and  that  the  stages  are 
numbered  in  descending  order  from  M  to  1  (from  left  to  right)  to  coincide  with  the  bit  order  in  a  word.  The  number 
of  data  cells  on  each  sage  is  equal  toN,  while  the  number  of  control  cells  is  N-l.  The  number  of  stages  that  are 
required  is  equal  to  the  length  of  each  wont  in  bits.  The  interconnection  pattern  between  any  pair  of  stages  is  a 
straightforward  one-to-one  mapping.  As  we  we  going  to  show  Utter,  the  3-D  version  of  this  sorting  network  retains 
these  regular  interconnections  which  can  be  realized  optically  using  lensiet  arrays  between  two  stages.  Another 
characteristic  of  this  network  is  its  regular  structure  which  also  contributes  to  a  simple  implementation  and  permits 
easy  scaling  to  accommodate  lager  sett  of  words. 


Figure  1.  Schematic  of  the  3-D  optoelectronic  architecture 
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Sorting  Algorithm 

As  mentioned  earlier,  the  sorting  algorithm  is  a  combination  of  the  odd/even  transposition  and  the  radix  sort 
algorithms.  The  process  is  initiated  by  looking  at  the  most  significant  bit  slice  and  trying  to  partition  it  into  two 
groups:  the  O-group  at  the  top  and  the  1-group  at  the  bottom.  Every  pair  of  adjacent  bits  that  are  not  equal  allows 
a  decision  on  the  relative  mag¬ 
nitude  of  two  words  to  be 
reached.  If  the  two  words  must 
be  exchanged,  the  process  is 
completed  in  a  pipelined  fash¬ 
ion  with  an  unconditional  ex¬ 
change  control  signal  gradual¬ 
ly  forcing  all  the  lower  signifi¬ 
cance  bits  of  the  two  words  to 
exchange  positions.  If  a  deci¬ 
sion  cannot  be  reached  at  the 
MSB  slice  because  the  two  bits 
under  consideration  are  equal, 
the  decision  is  deferred  to  the 
next  most  significant  bit  slice 
by  sending  a  conditional  ex¬ 
change  control  signal  to  the 
next  stage.  This  will  force  the 
next  control  cell  to  perform  a 
bit  comparison  to  determine 
the  relative  magnitude  of  the 
two  words.  While  an  uncondi¬ 
tional  or  conditional  exchange 
signal  ripples  through  all  the 
stages  of  the  network,  a  new  set 
of  tests  can  be  initiated  at  the 
MSB  slice.  The  outcome  of 
these  tests  and  the  subsequent 
execution  of  any  action  re¬ 
quired  will  not  affect  the 
completion  of  the  previously 
initiated  exchanges  that  may 
still  continue  to  occur  towards 
the  final  stages  of  the  pipeline. 

Three  distinct  phases  are  present  in  the  tomographic  sorting  operation:  (i)  data  loading,  during  which  the 
bit-slices  are  pipelined  into  the  sorting  stages;  (ii)  sorting,  and  (iii)  data  unloading,  during  which  the  bit  slices  of 
sorted  words  exit  the  pipe. 

Discussion 

Despite  the  fact  that  special  purpose  numerical  sorting  processors  and  networks  have  relaxed  the  burden  of 
sorting,  when  the  size  of  these  networks  becomes  large,  the  latency  of  the  data  input/output  process  and  the  com¬ 
plexity  of  the  interconnections  between  stages  become  serious  bottlenecks  that  render  their  implementation  im¬ 
practical.  An  optoelectronic  sorting  network  such  as  the  one  we  presented  in  this  paper,  using  straight  pass  optical 
interconnections  for  interstage  communications,  allows  for  a  simpler  implementation.  Although  the  TSN  is  some¬ 
what  slower  when  compared  with  other  sorting  networks  such  as  the  bitonic  sorting  network,  it  is  simpler  to  imple- 

The  optical  bitonic  sorting  network  [2]  becomes  prohibitively  complex  as  the  number  of  words  to  be  sorted 
increases.  The  reduced  space  complexity  of  the  TSN  is  its  main  advantage  over  other  sorting  networks. 

We  plan  to  design  and  fabricate  a  2-D  tomographic  sorting  unit  in  a  VLSI  chip  and  experiment  with  a  smaller 
version  of  a  3-D  architecture  using  mays  of  VCSELs  as  sources  and  heterostructure  phototransistors  as  detectors. 
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Abstract.  An  optoelectronic  implementation  of  bitonic  sorting  is  presented  which  uses 
a  recirculating  architecture  to  reduce  the  required  number  of  stages  to  two.  This 
architecture  decreases  the  mechanical  complexity  at  the  cost  of  system  capacity. 


While  a  wide  variety  of  sorting  algorithms  have 
been  explored  for  implementation  in  both  hardware 
and  software,1  only  die  implementation  proposed  by 
Stale  and  Athale2  utilized  the  parallelism  associated 
with  optoelectronic  processing  arrays.  Their 
pipelined  architecture  was  based  on  the  bitonic  sorting 
network  shown  in  figure  1.  Each  stage  of  the  sorting 
network  was  comprised  of  latching  compare-and- 
exchange  (CAE)  modules  which  performed  a  bitwise 
comparison  on  two  input  words.  The  stages  were 
connected  with  optical  perfect  shuffle  interconnection 
networks.  Since  the  number  of  processing  stages 
required  to  implement  a  pipelined  sorting  system  is 
dependent  on  the  number  of  words  to  be  sorted,  the 
realization  of  a  large  capacity  pipelined  sorter  may  be 
limited  by  a  maximum  achievable  mechanical  com¬ 
plexity  (defined  by  Cheng  et  al.  as  the  work  required 
to  build  an  optoelectronic  system).3 


Figure  1.  Bitonic  sorting  network. 

This  paper  proposes  an  optoelectronic  sorting 
implementation  which  reduces  the  number  of  required 
stages  to  only  two  and  thus  the  system  has  a  low 
mechanical  complexity.  Figure  2  shows  the  system 
layout  of  this  sorter  which  is  comprised  of  the  two 


Figure  2.  Recirculating  bitonic  sorter. 

stages  an  opposite  sides  of  s  beam  splitter.  This 
arrangement  is  compact  and  the  beam  splitter  provides 
a  means  for  convenient  optical  input  and  output. 
Each  stage  of  this  recirculating  system  consists  of 
CAE  modules  which  compare  adjacent  elements  of 
the  input  set.  The  optical  outputs  of  one  stage  pro¬ 
vide  optical  inputs  for  the  other  stage.  Thus,  the  data 
can  be  transmitted  optically  between  the  processing 
stages  through  the  same  interconnection  network  used 
to  implement  the  pipeline  architecture. 

We  are  presently  fabricating  a  4  x  4  demonstra¬ 
tion  of  tins  sorting  implementation  which  will  be 
presented  at  the  conference.  The  bitwise  CAE  mod¬ 
ule*  in  each  stage  are  based  on  the  hybrid  integration 
of  silicon  photodetectors,  CMOS  logic,  and  vertical 
cavity  surface  emitting  lasers  (VCSEL).  In  addition, 
this  paper  discusses  fabrication  reliability,  capacity 
and  mechanical  complexity  issues. 
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This  recirculating  architecture  requires  that  each 
array  hold  all  of  the  data  bits  for  each  of  the  words  to 
be  sorted.  Thus,  the  data  are  available  for  word-wise 
CAE  modules.  However,  the  simplicity  of  bitwise 
CAE  modules,  which  operate  on  pairs  of  input  bits, 
is  desirable.  In  order  to  compare  m-bit  words  with 
bitwise  CAE  modules,  m-bitwise  CAE  modules  are 
electrically  together  on  a  single  chip.  These 

electrical  comoctious  are  used  to  communicate  control 
signals  such  as  die  exchange  status  and  exchange 
criteria. 

While  die  proposed  recirculating  architecture 
reduces  the  number  of  required  processing  stages  to 
two,  the  system  capacity  (the  number  of  words  that 
can  be  processed  in  parallel)  is  also  reduced  unless 
the  size  of  each  stage  is  increased.  Thus,  there  is  a 
hardware  trade  off  between  the  pipelined  and  recircu¬ 
lating  architectures.  The  former  requires  many  stages 
while  the  latter  requires  two  large  stages. 

Figure  3  plots  the  system  capacity  as  a  function  of 
array  sin  for  both  architectures.  As  expected,  the 
pipeline  architecture  exhibits  significantly  larger 
system  capacity  due  to  the  n  *  dependence  associated 
with  the  bit  aerial  data  flow.  The  capacity  of  the 
recirculating  architecture  is  dependent  on  die  word 
length,  m.  Figure  3  shows  the  capacity  for  m  =  8, 
16,  32,  and  n  . 


Figure  4.  Capacity  of  recirculating  sorter. 

While  it  is  possible  to  increase  system  capacity  by 
utilizing  larger  arrays,  available  fabrication  technolo¬ 
gy  will  set  a  maximum  achievable  limit  on  array  size. 
In  order  to  overcome  this  limitation  on  system  capaci¬ 
ty,  it  is  possible  to  implement  each  stage  of  the 
recirculating  architecture  with  multiple  arrays. 
However,  this  increases  die  mechanical  complexity 
of  die  system. 

Figure  4  plots  the  number  of  arrays  required  to 
implement  a  bi tonic  sort  as  a  function  of  maximum 
array  size  (and  sorter  capacity)  for  both  architectures. 
For  comparison  it  is  assumed  that  system  capacity  is 


the  —me  for  both  architectures.  The  shaded  portion 
of  figure  4  indicates  the  region  where  the  recirculating 
architecture  requires  fewer  arrays  than  the  pipeline 
architecture.  Note  that  for  word  lengths  shorter  than 
64  bits,  the  number  of  arrays  required  for  the  recircu¬ 
lating  system  is  always  less  that  the  number  required 
for  the  pipeline  system. 


Capacity  (Wonts) 


Figure  3.  Arrays  required  to  implement  bitonic  sorter. 

Stmunary 

We  have  presented  an  optoelectronic  sorter 
based  on  a  recirculating  architecture.  Since  the 
recirculating  architecture  requires  only  two  processing 
stages,  the  mechanical  complexity  of  this  implementa¬ 
tion  is  lower  than  pipeline  implementations.  Each 
stage  of  die  recirculating  system  is  comprised  of 
bitwise  CAE  modules  which  use  electrically  transmit¬ 
ted  control  signals  and  optical  data  transmission.  We 
are  presently  fabricating  a  demonstration  system 
which  will  sort  four  4-bit  words.  The  bitwise  CAE 
modules  used  in  this  demonstration  will  be  based  on 
hybrid  integration  of  silicon  and  VCSEL  technologies. 
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Abstract 

The  experimental  implementation  of  an  optoelectronic  parallel  processing  system  is  described.  A 
reconfigurable  computer  generated  hologram  is  used  to  provide  a  programmable  interconnection 
kernel  and  simple  cellular  processing  operations  are  demonstrated. 


Optical  interconnections  offer  an  attractive 
solution  to  the  interconnection  bottleneck  which 
is  encountered  within  high-speed  parallel 
processing  systems.  In  the  case  of  single 
instruction  multiple  data  (SIMD)  architectures 
the  interconnection  mapping  between  the 
processing  elements  is  shift-invariant.  This  has 
led  to  interest  in  the  use  of  Fourier  plane 
holograms  to  perform  this  operation.  We  have 
previously  described  the  design  of  an 
optoelectronic  processing  system  in  which  a 
programmable  interconnection  is  obtained  by  the 
use  of  a  reconfigurable  computer  generated 
hologram  (CGH)  [1].  In  this  paper  we 
demonstrate  simple  cellular  processing 
operations. 

The  structure  of  the  optoelectronic  cellular 
processing  system  is  shown  in  Fig.  1.  The 
processing  plane  consists  of  an  array  of 
optoelectronic  processing  elements  (PEs),  each  of 
which  receives  optical  input  via  a  photodetector 
(PD)  and  provides  an  optical  output  via  a  light 
emitting  diode  (LED).  The  design  of  the 
processing  elements  is  based  upon  that  recently 
described  by  Ishikawa  et  al  [2].  Each  PE  can 


perform  simple  bit-serial  8-bit  addition  and  logic 
operations.  The  PEs  are  interconnected  optically 
through  the  use  of  a  reconfigurable  Fourier  plane 
CGH.  The  output  from  the  LED  array  is  imaged 
onto  the  photodetector  array  via  the  two  lenses 
L.  These  form  a  4-f  optical  system  and  the 
reconfigurable  CGH  is  placed  in  the  Fourier 
plane.  The  interconnection  kernel  is  determined 
by  the  state  of  the  CGH. 

In  the  experimental  system  the  PE  array  has  been 
simulated  with  an  array  of  LEDs  together  with  a 
CCD  camera.  The  reconfigurable  CGH  has  been 
implemented  by  displaying  a  binary  CGH 
pattern  on  a  liquid  crystal  display  (LCD)  screen 
(containing  640  x  400  pixels).  This  is  then 
optically  reduced  onto  an  optically  addressed 
spatial  light  modulator  (OASLM).  This  is  a 
parallel  aligned  nematic  liquid  crystal  (PAL) 
SLM  manufactured  by  Hamamatsu  Photonics 
K.K.  and  has  a  resolution  of  50  Ip/mm  [3], 
allowing  pixel  sizes  of  10-20  pm  to  be  obtained. 

Several  simple  feature  detection  operations  have 
been  demonstrated  with  this  system.  A  binary 
CGH  was  designed  by  use  of  the  simulated 
annealing  algorithm  [4].  This  encoded  a 
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convolution  kernel  which  was  designed  to  give  a 
maximum  response  to  groups  of  three  isolated 
pixels  in  the  input  plane.  Bipolar  interconnection 
weights  were  achieved  by  using  alternative  rows 
in  the  output  plane  resent  positive  and 
negative  values.  Ever.  ...y  this  operation  may 
be  performed  by  the  use  of  an  array  of  bipolar 
photodetectors.  In  the  experimental  system  this 
was  emulated  by  measuring  and  subtracting  the 
intensities  in  alternate  rows  of  the  output  plane 
with  a  CCD  camera.  Several  input  patterns  were 
presented  via  the  LED  array  and  it  was  found 
that  the  system  was  able  to  reliably  differenHi»je 
groups  of  three  isolated  pixels  as  requir  V 
interconnection  accuracy  was  found  to  be  x21% 
for  this  operation.  This  errror  was  dominated  by 
the  relatively  large  non-uniformity  (16%)  of  the 
LED  array  intensites  at  the  exit  plane.  The 
remaining  5%  was  due  to  errors  in  the  CGH. 

By  updating  the  hologram  pattern  on  the  LCD 
the  interconnection  kernel  may  be  modified,  thus 
allowing  a  programmable  SIMD  processing 
system  to  be  obtained.  Further  work  is  required 
to  reduce  the  interconnection  errors  in  the 
experimental  system  and  to  improve  the  high 
insertion  loss.  This  is  caused  by  the  small 
hologram  aperture  together  with  the  long  focal 
lengths  (2m)  of  the  Fourier  transform  lenses  and 
the  Lambertian  nature  of  the  LEDs.  We  will 
present  techniques  by  which  the  performance  of 
this  system  may  be  improved  and  will  consider 
issues  of  scalability.  We  will  also  demonstrate 
that  analogue  interconnection  weights  may  be 
achieved  by  the  use  of  this  approach. 


Processing 


Reconflgurable 

CGH 


Figure  1.  A  looped  optoelctronic  cellular 
processing  system:(PD  -  photodetector,  PE  - 
processing  element,  BS  -  beam  splitter,  L  - 
Fourier  transform  lens). 
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ABSTRACT  ( 

Two  digital  optical  computing  architectures  are  being  developed  for  compute  intensive 
applications.  A  32-bit  digital  optical  processor  has  been  implemented  in  hardware.  Develop¬ 
ment  efforts  are  underway  to  design  miniature  high  performance  optoelectronic  computing 
(HPOC)  modules. 

INTRODUCTION  < 

Two  digital  optical  processor  designs  are  being  developed  for  hardware  implementation. 

A  second  generation,  32-bit,  digital  optoelectronic  computer  (DOC  II)  prototype  has  been 
demonstrated  at  peak  speeds  of  up  to  1012  binary  operations  per  second.  DOC  II  is  fully 
programmable  and  operates  in  a  UNIX  environment  running  RISC  microcode.  The  64  channel 
prototype  operates  at  the  5000  photon  per  gate  level,  which  is  equivalent  to  1.2  femtoJoules  (fj) 
per  bit.  * 


HPOC  modules  are  being  developed  which  exploit  the  same  optical  principles  as  DOC 
II.  This  new  architecture  integrates  III-V  technology  with  global  (multi-dimensional)  free  space 
“Smart”  optical  interconnects.  These  low  power  modules  are  designed  to  operate  at  speeds  up 
to  1014  operations  per  second.  It  is  anticipated  that  each  module  will  be  less  than  10  cubic 
centimeters  and  will  yield  1  million  optical  interconnects. 


DOC  II 

The  hardware  consists  of  two  primary 
assemblies  (Figures  1  and  2):  the  illumination 
assembly  (Train  A)  and  the  modulation  relay 
assembly  (Train  B).  Both  trains,  which  are 
optimized  for  maximum  throughput  at  837  nm, 
are  positioned  on  a  36”  x  48”  optical  table  which 
is  packaged  in  a  optoelectronic  cabinet 

The  system  architecture  is  based  on  a  N2 
lxl  parallel  interconnect  topology.  At  the 
microcode  level,  each  instruction  can  be  written  as 
a  series  of  parallel  combinatorial  functionals.  Data 
re-use  at  the  control  mask  is  achieved  by  operating 
on  the  data  several  times  within  one  instruction. 
Up  to  128  selections  (logical  functionals)  of  64 
input  signals  can  be  computed  every  10  nsec.  The 
selection  occurs  by  use  of  a  control  mask  which  is 
clocked  into  the  spatial  light  modulator  (64  x  128 
input  control  data  matrix).  Thus,  the  processor 
can  achieve  an  optical  data  carrier  fan-out  of 
1:128  and  a  control  logic  fan-in  of  64:1. 

Current  compute  intensive  applications 
include  RISC  emulation,  full  text  data  searches, 
and  multimedia  database  development. 
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HPOC  Modules 

HPOC  modules  are  being  developed  to  provide  a  high  speed,  low  power  inter-module 
enhancement  to  GaAs  logic  families.  The  architecture  exploits  optical  interconnect  technology 
[1-2],  and  capitalizes  on  the  advantages  of  "Smart"  global  interconnects  which  include  high  fan¬ 
outs  and  fan-ins,  low  power  consumption,  high  algorithmic  efficiency,  and  high  noise  margin. 
Because  of  the  high  fan-out  /fan-in  capability  and  high  clock  rates  (100  MHz+)>  wide  word 
processing  is  realized.  This  is  accomplished  by  increasing  the  number  of  multi-input  gates 
thereby  decreasing  the  number  of  gate  delays. 


Figure  3  is  an  example  of  a  "Smart"  interconnect  where  the  fan-m  is  N  bits  wide.  When 


an  arbitrary  digital  wide  word  is  represented  in  its 
complemented  form,  the  detector  acts  as  an  OR 
gate,  literally  performing  a  Boolean  summation  on 
the  wide-word.  After  electronic  inversion  and 
laser  emission,  the  output  light  represents  the  N- 
bit  AND  product  of  the  input  bits.  Consequently, 
a  single  gate  delay  with  64-bit  AND  gates  is 
possible.  These  structures  may  be  arbitrarily 
expanded  to  any  digital  function  required  such  as 
wide  word  addition,  counting  and  floating  point 
multiplication  structures. 


Figure  3:  Simple  example  of  "Smart"  interconnect  which 
demonstrate!  De Morgan’s  theorem  on  free  space  optical 
interconnects  to  achieve  a  wide  input  AND  function 


Figure  4:  Single  stage  global  free  space  "Smart" 
interconnect  module  utilizing  DANE  switching 
devices  to  form  arbitrary  min  terms 


Each  interconnect  module  consists  of 
an  array  of  2-dimensional  GaAs  DANE  (Figure 
4)  cells  and  a  diffractive  optical  interconnect 
element  (DOIE).  "DANE"  is  an  acronym  that 
refers  to  a  "Smart"  pixel  which  [1]  detects 
light,  [2]  amplifies  the  result,  [3]  negates  the 
result  (inversion)  and  [4]  emits  the  Boolean 
value  through  the  output  laser.  DOIEs 
selectively  image  each  element  on  the  DANE 
array  to  predetermined  positions  on  the  next 
DANE  detector  plane.  The  logical  functions 
performed  by  the  combination  of  DANE  and 
"Smart"  interconnects  are  Shannon’s  minterms 
(functionals)  at  the  photodetector  array  and  the 
summation  of  minterms  or  complete  instructions 
at  the  logical  summation  of  the  laser  array. 


In  Figure  4,  it  is  possible  to  expand  both  the  %  input  arrays  to  two  dimensions  and  the  linear 
DOIE  arrays  to  two  dimensions.  The  DOIE  arrays  are  performing  a  4  dimensional  interconnect 
The  input  is  written  X^,  the  control  mask  is  ILy  and  the  output  may  be  written  tu  where  a  two 
dimensional  array  of  minterms  is  generated.  Each  min  term  generated  in  the  output  array  can 
consist  of  up  to  i*j  Boolean  variables.  Equation  1  summarizes  the  4  dimensional  interconnect 
scheme: 


fu=  £  £  *a  u = n  n 

“  i  ■  1  j  =  1  «  i  a  1  j  a  1  « 


(1) 


HPOC  modules  are  being  designed  to  solve  compute  intensive  algorithms  for  applications  such 
as  data  compression,  data  encryption,  artificial  intelligence,  high  speed  switching,  numerical  analysis 
and  finite  element  analysis.  MCM  communications  applications  are  also  being  pursued  because  the 
global  technology  allows  for  increased  fan-in  and  interconnect  density. 

[1]  J.  W.  Goodman,  F.  Leooberger,  H.  King  &  R.  Atbak,  "Optical  Interconnects  for  VLSI  Systems”,  Ptoc.  IEEE  VoL  72, 1984. 

[2]  y  W.  Goocfcnan,  “Optics  as  an  Interconnect  Technology”  in  H.  H.  Araenault,  T.  SzopBic,  B.  Macukow,  "Cortical  Processing  and 
Computing”,  1989  Academic  Press  Inc.,  Boston. 
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Abstract  -  This  paper  presents  the  physical  design  and  demonstration  of  an  optical  content-addressable  parallel 
processor  (OCAPP).  We  present  the  optical  implementation  of  our  laboratory  prototype  and  discuss  the  experimental 
results  of  a  representative  string  search. 
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Fig  1:  Structural  organization  of  OCAPP. 


Summary  -  The  demand  for  high-speed  parallel  processors,  stimulated  by  applications  such  as  real-time  control  sys¬ 
tems  and  database  processing,  encouraged  researchers  to  seriously  consider  the  use  of  optics  for  solving  the  problems 
experienced  by  conventional  architectures.  Our  original  research  in  this  area  led  to  the  development  of  a  new  architec¬ 
ture  called  Optical  Content-Addressable  Parallel  Processor  (0CAPP)[1].  The  architecture  is  designed  to  exploit  optics 
advantages  fully  in  interconnects  and  high-speed  operations.  This  paper  discusses  the  physical  implementation  of  an 
experimental  prototype  version  of  OCAPP.  that  we  are  constructing  in  our  Optical  Computing  and  Parallel  Processing 
Laboratory  at  the  University  of  Arizona.  We  begin  by  first  presenting  a  brief  overview  of  the  architecture,  which  is 
then  followed  by  the  layout  of  the  optical  system  and  photographs  of  the  system’s  experimental  results. 

Architectural  description  of  OCAPP: 
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Fig  2:  The  optical  system  for  the  experimental  OCAPP. 


Fig.  1  illustrates  the  structural  organization  of  OCAPP.  The  architecture  consists  of  the  selection  unit,  match/ compare 
unit  (MCU),  response  unit,  output  unit,  and  control  unit(not  illustrated).  The  selection  unit  is  an  n  x  m  bit  storage 
array  loaded  in  parallel  from  optical  memory.  Word  and  bit-slice  selection  logic  allow  the  enabting/disabling  of  words 
and/or  bit-slices  as  required.  The  output  of  the  selection  unit  is  processed  by  the  MCU  where  the  parallel  matching  of 
the  storage  array  and  the  lxm  interrogation  register  (I)  occurs.  The  1  register  stores  the  comparand(s)  of  a  search.  The 
processing  capability  of  OCAPP  relies  on  the  results  of  equivalence  and  threshold  comparisons  between  the  I  register 
and  a  multi-element  data  set  known  as  the  2-D  optical  data  array.  An  equivalence  comparison  determines  if  two  words 
are  equal  or  not  equal  while  a  threshold  comparison  determines  the  relative  magnitude  of  an  inequality.  Three  n  x  1 
bit  output  registers;  E,  G,  and  L  result  from  the  MCU  operation.  A  one  in  the  R  register  denotes  the  equivalence  of 
the  1  register  with  the  corresponding  row  of  the  enabled  storage  array.  Accordingly,  ones  in  the  G(L)  register  indicate 
words  of  the  storage  array  that  are  greater(less)  than  I.  Meanwhile,  the  match /detector  bit  signifies  that  at  least  one 
word  of  the  storage  array  matches  I.  Extended  versions  of  the  architecture  make  use  of  the  single-iteration  thresholding 
algorithm  [2]  to  process  threshold  searches  in  constant  time.  Based  upon  the  contents  of  the  R,  G,  and  L  registers,  the 
desired  words  of  the  data  set  are  transferred  to  the  output  array  by  the  output  unit. 

'This  research  was  supported  by  u  NSF  great  No.  MIP  9113688. 
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Optical  Implementation  of  OCAPP:  Our  system  uses  a  mixture  of  electronics  and  optics.  To  begin,  all  of  the  electronics 
are  controlled  by  a  Motorola  M68HC11  microcontroller  board.  The  spatial  light  modulators  (SLMs)  are  active-matrix 
liquid  crystal  televisions  (LCTVs).  Instead  of  using  frame  grabbers  to  write  the  LCTVs,  we  interfaced  video  generation 
ICs  to  the  microcontroller  to  generate  a  composite  video  signal  of  the  pattern.  After  the  patterns  are  written  to  the 
LCTVs,  linear  CCD  arrays  detect  the  result  which  is  then  digitized  by  in-house  circuits. 

A  detailed  illustration  of  the  optical  system  appears  in  Fig.  2.  The  system  is  sourced  by  an  Argon  laser.  The  beam 
passes  through  a  half-wave  plate  that  aligns  the  light  polarization  with  the  liquid  crystals  in  LCTVl.  The  diffuser 
makes  the  intensity  variation  of  the  data  beam  more  uniform  while  also  reducing  the  double-slit  interference  effects 
due  to  the  pixel  apertures.  Lens  LI  collimates  the  source  beam.  The  combination  of  the  first  two  LCTVs  forms  a 
vector-matrix  multiplier.  The  I  register  is  loaded  into  each  row  of  LCTVl  while  the  2-D  optical  data  array  is  loaded 
into  LCTV2.  Multiply  writing  the  LCTVl  pattern  eliminates  the  beam  expansion  optics  necessary  for  vector-matrix 
multiplication.  The  result  is  then  split  into  two  paths.  The  path  reflecting  off  the  beamsplitter  is  spatially  filtered  and 
imaged  onto  CCD1  by  lenses  L2,  L3,  L4  and  CL1.  This  branch  performs  equality  searches  and  reports  its  results  in 
the  R  register  (CCDl).  In  the  other  path,  the  bit-slice  and  word  disabling  functions  of  the  selection  unit  are  provided 
by  LCTV3.  The  result  is  spatially  filtered  and  imaged  onto  CCD2  by  lenses  L5,  L6,  L7,  and  CL  to  form  the  G  register. 
The  L  register  was  not  implemented  because  it  does  not  demonstrate  any  more  functionality  than  the  G  register  does. 
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(»)  (b)  (c)  (d) 

Fig  3:  Physical  demonstration  of  the  parallel  search  capability  of  OCAPP. 

Experimental  Results:  The  data  in  our  initial  experiments  is  encoded  using  a  dual-rail  spatial  encoding  scheme  [3j.  In 
this  scheme,  both  a  binary  value  and  its  complement  represent  a  single  bit,  requiring  two  pixels  per  bit.  The  optical 
matching  operation  is  performed  by  superimposing  the  patterns  written  to  LCTVl  and  LCTV2  with  the  incident  laser 
beam.  Two  input  bits  are  logically  equivalent  if  both  pixels  of  the  result  are  dark.  Illumination  present  in  either  pixel 
of  a  result  bit  indicates  the  mismatch  of  the  two  input  bits.  Two  words  are  equivalent  if  there  is  no  light  in  any  bits 
of  the  result. 

Words  are  mapped  onto  the  SLMs  by  partitioning  the  LCTV  display  areas  into  eight  rows  of  sixteen  squares  to 
demonstrate  an  8  x  8  data  array.  We  load  the  comparand  10101010  into  each  of  the  eight  rows  of  LCTVl.  Meanwhile, 
the  patterns  00000000,  01001010,  10101111,  00101010,  *10101010,  10101011,  00001010,  11111111  are  written  to  rows 
one  through  eight  of  LCTV2.  Notice  that  the  word  in  row  5,  highlighted  by  the  asterisk,  matches  the  comparand  while 
the  words  in  rows  4  and  6  differ  by  only  a  single  bit.  These  three  rows  ultimately  determine  the  system’s  success  since 
the  digital  thresholding  of  the  result  must  be  capable  of  distinguishing  perfectly  matching  entries  from  those  with  at 
least  a  single  mismatch. 

Experimentally-obtained  photographs  of  the  data  planes  are  shown  in  Fig.  3.  The  input  patterns  written  to  LCTVl 
and  LCTV2  are  shown  in  Fig.  3(a)  and  Fig.  3(b),  respectively.  In  Fig.  3(c),  we  report  the  results  of  the  optical  matching 
operation.  We  see  that  row  5  is  completely  dark  because  of  the  perfect  match  between  the  comparand  and  the  fifth 
entry.  A  single  pixel  is  illuminated  in  rows  4  and  6,  indicating  a  single  mismatch  between  these  array  entries  and  the 
comparand.  Furthermore,  the  photograph  in  Fig.3(d)  illustrates  the  output  of  the  horizontal  summing  of  Fig.  3(c). 
Note  that  the  column  vector  is  vertically  inverted  due  to  the  spatial  filtering  operation.  Overall,  the  results  display  a 
high  contrast  level  and  successfully  demonstrate  the  system’s  ability  to  perform  optical  parallel  string  searches.  The 
CCD  detects  the  result  and  reports  it  to  the  microcontroller.  A  new  set  of  patterns  are  generated  and  the  feedback 
is  completed.  Please  note  that  the  architecture  is  not  limited  to  parallel  string  searches.  There  are  a  wide  variety  of 
applications  such  as  knowledgebase  processing,  parallel  sorting,  etc.  that  are  excellent  candidates  for  implementation 
on  OCAPP.  More  details  and  results  will  be  provided  at  the  Meeting. 
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Abstract:  An  optoelectronic  database  filter  demonstration  using  arrays  of  phototransistors 
and  surface  «whring  lasers  connected  as  AND  and  XOR  gates  will  be  presented. 


Introduction: 

Optoelectronics  can  have  an  advantage  over 
electronics  in  database  operations  if  die  parallel¬ 
ism  of  optics  is  employed.  Utilizing  massively 
parallel  optical  signals  from  memories  such  as 
parallel  read-out  optical  disks,  volume  holo¬ 
grams,  and  3-D  two-photon  memories,  large  sets 
of  records  in  a  relation  can  be  simultaneously 
compared  against  a  search  argument  [1].  In  the 
past  much  effort  has  been  spent  in  developing 
either  devices  for  optoelectronic  systems  or 
systems  for  optoelectronic  devices  but  very  little 
effort  on  integration  of  die  devices  with  the 
systems.  The  database  filter  affords  die  opportu¬ 
nity  to  create  a  system  with  modern  devices 
which  can  utilize  massive  parallelism  in  an 
application  where  electronics  cannot  realistically 
compete.  Previously  [1,2]  we  have  reported 
optoelectronic  architectures  that  function  as  a 
data  base  filter.  Figure  1  shows  the  current 
architecture,  which  requires  2-D  arrays  of  AND 
and  XOR  logic  gates.  We  have  shown  that  these 
smart  pixels  can  be  realized  with  a  combination 
of  heterojuncdon  phototransistors  (HPTs)  which 
serve  as  photodetectors,  amplifiers,  and  logic 
switches;  and  vertical  cavity  surface  emitting 
lasers  (VCSELs)  which  are  efficient  optical 
output  devices  dun  have  a  low  divergence  angle. 
This  paper  describes  the  implementation  of  a 
database  filter  types  of  devices. 


optical  signal  flow  [1]. 

In  die  initial  implementation,  the  logic  gate 
arrays  will  utilize  a  hybrid  interconnection  of 
HPTs  and  VCSELs,  each  in  separate  packages. 
The  AND  and  XOR  gate  arrays  have  been 
demonstrated  as  single  pixels  [3]. Optical  gains  of 
these  hybrid  devices  have  been  measured  to  be 
approximately  3  and  are  expected  to  increase  to 
over  5  in  the  near  future  [3],  which  is  sufficient 
to  overcome  the  losses  introduced  by  the  beam 
splitters  between  successive  stages  of  the  filter. 
VCSEL  arrays  (Figure  2)  have  been  fabricated' 
and  the  mask  set  for  the  AND  and  XOR  arrays 
have  been  designed  and  fabricated.  The  layout 
designs  for  the  XOR  arrays  (Figure  3)  required 
special  consideration  of  the  interconnects  and 
traces  for  the  bonding  pads  in  order  to  minimize 
array  area  while  maximizing  the  input  window 
area  of  the  phototransistors.  This  increases  the 
gain  efficiency  of  the  detector  and  allows  easy 
optical  coupling  of  the  input  signals.  An  impor¬ 
tant  characteristic  of  the  XOR  configuration  is 
the  fact  that  the  series  connected  HPTs  must  be 
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able  to  conduct  twice  as  much  current  as  the 
parallel  HPT*  when  both  inputs  are  "ON*  (logic 
1),  which  enables  the  current  through  die 
VCSEL  to  fall  below  1*  and  die  output  to  be 
"OFF"  (logic  0)  [3].  This  has  been  accomplished 
by  reducing  die  input  window  of  the  parallel 
connected  HPTs  with  an  opaque  metal  mask. 
This  method  requires  that  the  input  spot  be  not 
smaller  the  input  window.  The  HPT  arrays 
are  being  fabricated  from  InGaP/GaAs  layers 
grown  by  gas  source  molecular  beam  epitaxy  as 
described  previously  [4].  This  choice  of  material 
system  allows  for  dependable  processing  which 
will  improve  device  uniformity  across  the  wafer, 
as  well  as  provide  cascadability.  The  VCSELs 
were  fabricated  from  AlGaAs/GaAs  layers  grown 
with  MBE.  The  hybrid  arrays  will  be  packaged 
in  DIP  packages  and  optics  will  be  used  to  direct 
and  focus  die  optical  signals.  The  system  will  be 
constructed  on  an  optical  table  complete  with  an 
interface  to  an  electronic  computer  to  monitor 
operation  and  aid  in  obtaining  results. 


Figure  2:  Photograph  of  8  X  8  VCSEL  array  for 
hybrid  amait  pixel* 


Figure  4:  Layout  of  mask  for  AND  array 
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Abstract 

Fault  tolerance  can  be  incorporated  in  digital  optical  computers  using  a  distributed  redun¬ 
dancy  technique  called  quadding,  at  an  expense  of  quadruplicating  the  hardware,  and  fan-in  and 
fan-out  increasing  to  4. 

A  key  requirement  in  a  practical  digital  optical  computing  scheme  is  the  ability  to  recover 
from  transient  and  permanent  device  faults  and  signal  errors.  This  is  especially  critical  when 
using  emerging  technologies  such  as  optical  switch  arrays  which  are  bound  to  have  large  numbers 
of  device  errors,  and  for  optically  interconnected  systems  which  may  suffer  transient  errors  such 
as  that  due  to  dust  particles  floating  through  the  optical  beams.  Redundancy  is  the  most 
common  technique  utilized  to  endow  a  system  with  a  limited  degree  of  fault  tolerance  and 
increase  the  system  reliability  beyond  that  given  by  the  product  of  the  probabilities  of  correct 
operation  of  the  components.  However,  in  many  redundant  systems,  a  voter  is  required  to 
resolve  conflicts  between  the  redundant  components,  but  a  fault  in  the  voter  still  produces 
erroneous  outputs.  Although  multiply  redundant  voters  can  be  incorporated,  a  mechanism 
must  be  included  that  eliminates  and  replaces  faulty  elements  from  the  circuits,  or  else  errors 
can  propagate.  These  techniques  can  become  quite  complex  and  may  be  inappropriate  for 
optical  implementation.  Another  approach  is  to  distribute  the  voter  throughout  the  circuit 
using  the  technique  of  quadded  logic. [1-3)  In  quadded  logic,  4  copies  of  the  circuit  are  produced, 
then  interconnected  in  a  permuted  fashion  that  allows  isolated  errors  within  a  quadded  block 
of  elements  to  be  detected  and  corrected  within  the  next  few  layers.  This  is  often  considered 
an  expensive  approach  to  fault  tolerance,  since  it  multiplies  the  hardware  by  a  factor  of  4,  and 
doubles  the  fan-out  and  fan-in  of  the  elements.  However,  the  optical  implementation  of  quadded 
logic  in  a  regularly  interconnected  system  has  some  attractive  features  as  shown  in  Figures  1 
and  2.  The  depth  of  the  circuit  is  not  increased,  just  the  width,  so  no  additional  delay  or 
speed  penalties  are  imposed.  The  interconnections  between  the  quadded  circuits  are  reasonably 
regular  and  may  be  amenable  to  optical  implementations.  These  interconnection  topologies 
are  reminiscent  of  optical  crossovers,  and  might  be  implemented  with  a  similar  technique.  One 
possibility  is  to  interleave  the  original  circuits  on  rows  separated  by  4,  and  interleave  the  quadded 
duplicates  at  the  intervening  positions.  The  same  basic  architecture  of  shuffles  or  crossovers 
within  the  rows  can  be  performed,  and  holographic  interconnections  within  a  quadded  set  of  4 
rows  might  not  overly  increase  the  system  complexity. 

The  operation  of  a  quadded  system  based  on  a  regular  logical  structure  is  illustrated  in 
Figure  1.  Device  errors  are  indicated  as  signals  in  boldface  type.  For  example  x  —  1  should 
be  represented  at  the  input  with  four  l’s,  however  the  last  input  is  in  error.  The  quadded 
interconnects  in  the  first  layer  mix  the  signals  from  the  first  two  and  last  two  redundant  inputs, 
represented  as  (12,34).  This  corrects  the  NOR  gate  subcritical  error  1  — ►  0  in  one  step.  On 
the  other  hand,  the  input  error  for  x  is  a  critical  0  — ►  1  NOR  error,  which  is  converted  by  the 
first  layer  quadded  interconnect  to  two  subcritical  errors,  shown  in  italics.  As  long  as  the  next 
layer  of  quadded  wiring  mixes  the  correct  and  erroneous  signals,  these  subcritical  errors  will  be 
corrected  in  the  next  layer,  and  this  is  accomplished  by  mixing  the  odd  redundant  signals  and 
the  even  redundant  signals,  represented  as  (13,24).  Similarly,  errors  introduced  throughout  the 
logical  structure  can  be  corrected,  unless  they  appear  too  close  to  another  error.  As  a  result 
error  free  outputs  are  produced. 
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The  regular,  layered  structure  of  the  conventional  digital  optical  computer  interconnects 
is  eminently  suited  for  quadding  without  the  complexities  encountered  in  random  logic.  In 
addition,  the  quadded  interconnects  take  advantage  of  the  capabilities  of  the  optical  intercon¬ 
nects  to  accommodate  complex  wiring  topologies.  The  isomorphism  between  the  (12,34)  and 
( 14,23)  quadding  patterns  with  crossover  interconnects  should  allow  efficient  implementation  of 
quadding  in  crossover  networks,  and  in  these  regular  structures  only  two  alternating  quadded 
patterns  are  needed.  Alternatively,  by  placing  the  redundant  quadded  logic  devices  in  a  sparse 
topology  then  a  shift-invariant  interconnection  can  be  utilized.  The  example  shown  in  Figure 
3  shows  that  the  (12,34)  and  (13,42)  quadding  patterns  can  be  implemented  with  a  fanout  of 
3,  light  efficiency  of  66%  and  device  packing  densities  of  66%  and  50%  respectively,  while  the 
quadding  pattern  (14,23)  requires  a  fanout  of  5,  and  achieves  a  light  efficiency  of  only  40%  and  a 
packing  density  of  only  44%.  Quadding  as  2  by  2  blocks  allows  2-D  shift  invariant  interconnec¬ 
tions  with  44-50%  packing  density.  For  these  shift  invariant  topologies  the  (12,34)  and  (13,42) 
quadding  patterns  should  be  alternated. 

This  approach  to  redundant  fault  tolerant  digital  optical  computing  may  allow  the  utilization 
of  devices  with  increased  probabilities  of  failure  without  an  unacceptably  large  system  reliability 
penalty.  Such  an  approach  may  be  required  in  order  to  make  practical  and  reliable  digital  optical 
computers  out  of  simp’  arrays  of  unreliable  switching  elements. 
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Figure  1:  Error  correction  operation  of  a  quadded  regularly 
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Figure  2:  One  possible  3-D  topology  of  quadded  regular  in-  Figure  3:  Sparse  device  layout 
terconnections.  for  shift  invariant  quadding. 
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Abstract: 

Considerations  on  number  representation  are  introduced  to 
make  use  of  the  advantages  of  optics  in  computer  architectures. 
Specific  optical  systems  are  proposed  for  CORDIC  algorithms  and 
for  discrete  transforms. 


The  recent  developments  of  bistable  or  fast  switching  optical  components  (S-SEED, 
combination  of  laser  diodes  and  photodiodes)  do  not  alone  imply  that  optics  could  be 
an  alternative  technology  for  high  speed  parallel  computers. 

Generally  the  need  for  a  duality  between  algorithms  and  optical  implementations 
is  not  considered.  In  this  paper  we  focus  on  number  representations  and  give  some 
examples  of  optical  processing  architectures.  We  distinguish  two  well  known  number 
representations:  the  position  number  representation  and  the  residue  number  represen¬ 
tation. 

One  of  the  most  widespread  position  number  representations  is  the  binary  number 
representation  used  in  electronic  digital  computers.  This  number  representation  is 
imperfectectely  suited  to  optical  implementation  because  of  the  sequential  nature 
and  of  the  limited  spatial  representation  which  reduce  the  advantages  of  optical 
implementations.  The  signed  digit  number  representation,  introduced  by  Avizienis  [1], 
is  a  possible  alternative.  This  representation  permits  carry  free  parallel  processing 
with  a  number  of  steps  independent  of  the  length  of  the  number  representation.  If 
pattern  recognition  method  is  left  aside,  it  is  then  possible  to  implement  efficient 
algorithms  based  on  elementary  operations.  CORDIC  algorithms  [4]  are  initially 
developped  for  binary  number  representation  and  need  only  additions  and  shifts.  We 
demonstrate  that  CORDIC  algorithms  were  also  adaptable  to  modified  signed  digit 
number  representation.  We  then  propose  an  optical  implementation  which  takes 
advantage  of  an  incoherent  illumination,  spatial  optical  number  encoding,  parallel 
processing  and  low  cost  devices.  A  25  MHz  operating  frequency  is  expected  for  an 
all  optical  comparator  of  modified  signed  digit  numbers. 

CORDIC-like  algorithms  could  be  written  for  higher  radices  which  allows  a  more 
suitable  spatial  encoding  for  an  optical  processor.  The  position  number  representations 
are  not  optimal  because  they  do  not  take  full  advantage  of  the  parallelism  of  optics  and 
optically  addressed  component  arrays. 

The  residue  number  system  [2]  reduces  the  operation  complexity  by  dividing  the 
number  representation  into  small  independent  integers.  In  this  way,  the  quadratic 
residue  number  system  allows  computations  on  complex  numbers.  The  spatial  positions 
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of  spots  of  light  could  be  a  possible  optical  representation.  An  operation  is  then 
easily  obtained  in  one  step  by  looking  up  the  result  in  an  optical  memory.  This 
addre&oing  method  permits  parallel  processing  for  till  residues.  We  implement  this 
number  representation  in  butterfly  structure  for  discrete  transform  computations  [3]. 
The  regular  aspect  of  optical  interconnections  in  butterfly  structures  is  the  most 
attractive  property  of  a  passive  and  parallel  optical  network.  Each  node  of  this  network 
is  made  of  an  optical  lookup  table  structure  which  performs  computations  on  complex 
numbers. 

For  each  mathematical  number  representation,  specific  algorithms  are  developped 
and  benefit  from  the  properties  of  optics.  FYom  our  point  of  view,  it  is  advisable  to 
use  only  optical  codings  which  are  based  on  the  binary  intensity  modulation  of  light. 
In  this  way,  optical  implementation  could  bring,  for  example,  solutions  to  problems  of 
interconnection  in  electronic  parallel  computers 

1  Avizienis,A.,  Signed-digit  Number  representations  for  fast  parallel  arithmetic. 
IRE  Transactions  on  electronic  computers ,  EC-10  (Septembre  1961),  389-400 

2  Huang,  A.,  Tsunoda,  Y.,  Goodman,  J.W.,  Ishihara,  S.,  Optical  computation  using 
residue  arithmetic  Applied,  Optics ,  18,  2  (15  Janvier  1979),  149-162. 

3  Truong,  T.K.,  Chang, J.J.,  Hsu,I.S.,  Pei, D.Y., Reed, I.S.,  Techniques  for  computing 
the  discrete  Fourier  transform  using  the  quadratic  residue  fermat  number  systems 
IEEE  Transactions  on  Computers,  C-35,  11  (Novembre  1986),  1008-1012. 

4  Voider,  J.E.  The  CORDIC  Trigonometric  Computing  Technique.  IRE  Transac¬ 
tions  on  electronic  computers,  EC-8,  3  (Septembre  1959),  330-334. 
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Design  of  symbolic  substitution  systems  for  micro  integration 

C.  Passon,  K.-H.  Brenner  and  W.  Eckert,  University  Erlangen-NQrnberg,  Physikalisches  Institut, 

Angewandte  Optik,  Staudtstrafie  7/B2, 91058  Erlangen,  Germany, 

Tel.  +49  9131  858377,  Fax.  +49  9131  13508,  e-mail:  chria@ao.physik.uni-erlangen.de 

Abstract:  The  design  of  micro  optical  systems  has  to  consider  features  of  the  micro  optical  components.  We 
introduce  design  concepts  for  micro  optical  systems  exemplified  at  a  system  for  symbolic  substitution. 

Introduction  Systolic  arrays  represent  a  concept  for  developing  highly  parallel  computer  systems  using 
regularly  interconnected  simple  processor  arrays  [1].  We  have  recently  demonstrated  that  systolic  arrays  can  be 
easily  mapped  to  symbolic  substitution  rules  [2]  and  thus  can  be  implemented  optically.  We  have  constructed  an 
optical  pipeline  adder  based  on  systolic  arrays,  which  was  realised  with  macroscopic  optical  components. 

This  adder  consisted  of  an  array  of  8x8  half  adders,  performing  a  foil  addition  of  8-bit,  dual  rail  coded  numbers 
in  a  pipeline  within  8  iterations.  The  active  array  consisted  of  16x16  pixels.  From  space-bandwidth 
considerations  one  can  derive  that  the  imaging  of  an  such  an  array  requires  only  lens-diameters  of  a  few 
hundred  microns.  Consequently  the  size  of  the  whole  system  can  be  reduced  into  the  submillimeter  range  using 
micro  optical  components.  A  concept  for  such  a  miniaturising  of  free-space  optical  systems  was  recently 
presented  [3,4].  With  this  stacked  approach,  the  packing  density  and  the  connectivity  of  three-dimensional 
optical  systems  can  be  utilised  better  that  with  planar  integrated  optics. 

Design  Concepts:  Here  we  try  to  build  on  this 
integration  concept  in  order  to  conceptually  first 
realise  a  miniaturised  version  of  the  optical  pipeline 
adder  and  then  to  generalise  this  architecture  to 
implement  general  systolic  array  architectures.  The 
concept  takes  the  features  of  micro  optical 
components  into  account  and  is  thus  compatible  with 
fabrication  constraints 


Input 

Fig.  1:  Multiple  imaging  system 


Output 


The  optical  pipeline  adder  consists  of  a  recognition 

and  a  substitution  part.  Each  part  is  realised  by  a  sequence  of  two  multiple  imaging  systems  (MIS).  Active 
components  (NOR-  and  OR-gatc  arrays)  are  located  at  the  exit  of  each  part  Each  MIS  (fig.  1)  consists  of  two 
Fourier  transform  stages,  which  are  constructed  as  light  pipes,  because  this  configuration  offers  best  light 
efficiency  and  resolution  [5], 

The  complete  sequence  of  components,  needed  to  realise  a  substitution  system  is  shown  in  fig.  2.  This  stretched 
out  version  could  in  principle  be  realised  by  a  static  of  optical  components 
There  are,  however,  a  series  of  issues  that  have  to  be  addressed 


Fig.  2:  Complete  symbolic  substitution  stage 

First,  the  number  of  layers,  to  be  stacked,  is  very  large  in  this  version.  This  may  cause  problems  in  aligning  the 
system. 

Second,  the  active  device  in  the  centre  of  this  sequence  would  have  to  be  realised  in  transmission  mode,  which 
is  undesirable  both  for  thermal  and  for  connectivity  reasons.  A  more  realistic  design  should  {dace  the  active 
components  at  the  ends  of  the  stack.  Thus  a  folding  of  the  system  is  necessary. 


The  first  version  of  Added  systems,  shown  in  Fig.  3  is  a  straight  forward  step  from  the  multilayer  approach  to 
fewer  layers,  since  the  deflecting  minors  perform  both  functions,  the  folding  of  the  system  and  splitting  and 
shifting  of  data  planes.  The  space  between  the  light  pipes,  needed  for  the  deflecting  mirrors,  requires  a  pupil 
between  the  mirrors  to  prevent  vignetting.  This  is  not  compatible  with  a  layered  structure,  since  the  pupil  would 
have  to  be  oriented  perpendicular  to  the  substrate  surface,  in  order  to  be  located  in  the  centre  between  the 
mirrors. 
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Another  limitation  of  this  configuration  is  that  the  location  a  of  the 
pupil  has  to  satisfy  the  condition  D  £  a  £  ffi.  The  largest  numeric 
aperture  (N.A.)  is  achieved  for  minimal  a.  Thus  the  best 
configuration  D  ■  a  *  f/2  results  in  a  maximum  N.A.  of  0.5. 

The  light  efficiency  is  determined  by  the  diameter  of  the  pupil,  which 
is  given  by  Wp(a)  “  D(l-afi). 

In  our  second  approach  (fig.  4)  we  insert  the  mirrors  into  the  light 
pipes.  Here  all  components  are  arranged  in  separate  layers  and  no 
vertically  oriented  components  are  necessary.  In  this  approach,  we 
move  the  image  plane  away  from  the  input  data  plane.  The  N.A.  is 
also  limited  by  the  increased  distance  resulting  from  the  double 
mirror  reflection.  The  minimum  distance  between  lenses  is  2D, 
resulting  in  a  maximum  N.A.  of  0.5.  Here  beam  splitters  are 
necessary,  which  can  be  implemented  by  the  LIGA  technique  [6]. 


Input 


Interlace,  Deinterlace  Stage 
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Substitution  Stage 


Fig.  4:  Full  folded  substitution,  with  active  components  at  the  ends  of  the  layer  structure 

In  a  next  step  we  can  exploit  the  fact  that  the  required  shifts  in  two  successive  stages  are  identical.  Thus  the 
same  hardware  can  be  used  in  the  forward  and  in  the  backward  direction,  if  a  reflective  array  is  placed  between 
these  stages  (fig.  4,  right  side).  This  scheme  can  be  replicated  infinitely  above  and  below.  Thus  the  initially 
stretched  out  system  can  be  arbitrarily  cascaded  to  achieve  complex  systems  and  also  make  full  use  of  the 
available  substrate  area. 


[1]  W.  Erhard,  D.  Fey  Parallele  digitate  optische  Recheneinheiten ,  B.G.Teubner,  Stuttgart,  1994 

[2]  D.  Fey,  K..-K  Brenner  Digital  optical  arithmetic  based  on  Systolic  Arrays  and  Symbolic  Substitution  Logic, 
Optical  Computing  1, 153*167, 1990 

[3]  K.-R  Brenner,  Techniques  for  integration  of  3D-oprical  systems,  SPIE  Proc.  Vol  1544,  Miniature  and 
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Applications  of  LIGA  components  in  three-dimensional  microoptics,  Appl.  Optics,  32  No.  32, 6464-6469, 1993 
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Abstract 

Differential  pain  of  pnpn  photothyristors,  which  behave  as  comparators  with  optical  inputs  and  outputs,  can  be 
used  with  laser  speckle  to  implement  sigmoid  updating  probability  processors  and  thus  used  in  stochastic 
information  processing. 

Introduction 

In  optimisation  techniques,  the  simulated  annealing  algorithm  is  widely  used.  It  is  based  on  th  iterative 
procedure  of  an  elementary  stochastic  operation  which  provides  acceptance  or  rejection  with  moid 

updating  probability 

P{F)  =  l/[l+exp(-F/r)]  (1) 

where  the  parameter  T  is  called  the  temperature  and  should  decrease  slowly  over  time  (annealing)  to  reach 
energy  minimisation.  F  is  called  the  force  and  is  related  to  energy  gradient.  We  will  show  that,  when 
combined  with  speckle  illumination,  a  simple  processing  element  (namely  a  pnpn  photothyrist  jr)  can  perform 
this  stochastic  operation 

Optoelectronic  elementary  processors  arrays  are  essential  to  the  development  of  reliable  high-speed  parallt, 
stochastic  processing  units.  Many  different  optoelectronic  devices  could  conceivably  be  used  in  the  elaboration 
of  new  algorithms:  SEEDs,  optically  activated  VCSEL,  etc.  Pnpn  photothyristors  are  particularly  well-suited 
for  these  applications.  These  devices,  first  introduced  by  Jacques  Pankove1  and  his  co-workers,  associate  rapid 
and  easy  operation  with  high  sensibility  by  combining  light  detection  and  emission  in  a  single  unit.  The  basic 
pnpn  photothyristor  acts  as  a  simple  light  activated  diode.  This  GaAs  device  has  two  stable  operating  points: 
when  OFF,  the  impedance  is  high  and  no  light  is  emitted,  and  when  ON  the  photothyristor  conducts  and  emits 
light.  The  pnpn  photothyristors  are  most  useful  when  two  of  them  are  combined  in  a  differential  pair3.  This 
mode  of  operation  is  obtained  by  connecting  two  photothyristors  in  parallel.  The  voltage  source  applied  to  the 
pair  is  a  three-step  voltage  sequence  corresponding  to  reset,  light  detection  and  light  emission.  The  differential 
pair  of  photothyristors  can  be  viewed  as  a  simple  comparator,  the  photothyristor  which  has  detected  the  highest 
intensity  turns  on. 

Differential  detection  of  random  speckle 

Laser  speckle,  as  a  means  of  generating  high  volumes  of  random  numbers  for  parallel  processing,  offers  great 
flexibility  and  easy  implementation3.  By  differential  detection  of  two  speckle  intensities,  a  random  signal  of 
zero  mean  is  created  and  thus  an  easier  implementation  of  a  symmetric  output  probability  function  is  allowed. 

We  illuminate  a  differential  pair  of  pnpn  photothyristors  with  an  homogeneous  random  speckle  pattern.  As 
usual,  the  photothyristor  detecting  the  most  intense  speckle  will  switch  on.  We  can  calculate  this  switch-on 
probability  by  relating  it  to  the  speckle  probability  function.  The  speckle  intensity  I  on  a  detector  can  be 
statistically  described  by  a  gamma  probability  function  where  the  only  parameters  are  the  mean  intensity  of  the 
speckle  field  and  the  number  of  degrees  of  freedom  of  the  detected  speckle4.  To  simplify  the  analysis  in  this 
paper,  we  will  reduce  the  gamma  probability  functions  to  gaussian  ones.  This  approximation  is  valid  when  the 
number  of  degrees  of  freedom  is  large  -  in  practice  greater  than  10.  The  switch-on  probability  is  simply  the 
probability  that  the  first  incident  speckle  intensity  Ix  is  larger  than  the  second  one  I2.  We  also  implement  the 
force  Fo f  Eq.  1  by  illuminating  the  first  photothyristor  of  the  pair  with  an  additional  laser  beam.  The  resulting 
switch-on  probability  P(F)  of  the  first  photothyristor  is  given  by 

P(F)  =  p{li+F>lJ)  =  p(lJ-li<F)  (2) 

The  random  variable  (/2-/,)  is  the  difference  of  two  known  gaussian  random  variables  having  the  same 
statistical  parameters.  The  probability  distribution  of  the  new  random  variable  will  be  a  gaussian  probability 
distribution  of  zero  mean  and  double  variance.  Therefore,  the  probability  that  the  pnpn  where  the  additional 
force  Fis  applied  will  switch  on  is 
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which  is  simply  an  Erf  function.  This  Erf  function  is  equal  to  the  sigmoid  of  Eq.  1,  with  a  high  degree  of 
accuracy,  provided  that 


exp£-M(x/2  <  I  >)2  jdx,  (3) 


(4) 


This  temperature  is  found  by  fitting  the  slopes  of  the  sigmoid  and  of  the  Erf  function  for  F-O.  Consequently,  a 
differential  pair  of  photothyristors  illuminated  with  a  speckle  pattern  should  implement  the  stochastic  updating 
operation  of  Eq.  1. 


Switch-on  probability 
,1.0 


-Differential  pair 
understudy 


Fig.  1 


Experimental  results 

The  first  tests  of  the  differential  operation  described 
above  were  made  with  the  experimental  set-up  shown 
on  Fig.  1.  The  two  photothyristors  of  the  differential 
pair  used  were  identical  squares  of  30x50 
separated  by  IS  pm.  Two  laser  illuminations  were 
used.  First,  a  speckle  pattern  created  by  the  modal 
noise  of  a  step  index  fibre  was  used  to  randomly 
illuminate  the  pnpn  integrated  circuits.  The 
successive  generation  of  time  independent  speckles 
was  ensured  by  a  rotating  diffuser  inserted  between 
the  laser  source  and  the  fibre  input  Then,  a  laser 
diode  was  imaged  onto  one  of  the  photothyristor  of  the 
pair,  thus  acting  as  the  force  on  one  of  the 
photothyristor. 

Light  emission  was  recorded  by  imaging  onto  a 
photodiode  the  photothyristor  which  was  not 
illuminated  by  the  force.  We  synchronised  the 
detection  with  the  voltage  alimentation  of  the  pair. 
The  data  points  are  computed  switch-on  probabilities, 
each  corresponding  to  the  average  of  5000  intensity 
measurements.  Experimental  results  are  shown  on 
Fig.  2.  Three  sigmoid  updating  probability  curves, 
corresponding  to  three  different  temperatures,  are 
shown.  According  to  Eq.  4,  these  three  curves  were 
obtained  with  three  different  values  of  the  mean 
speckle  intensity  (I).  Since  the  temperature 
determines  the  amount  of  randomness  in  the  updating 
operation,  large  values  of  T  correspond  to  large  values  of  (/).  To  show  the  obvious  sigmoid  characteristic  of  the 
response  curves,  the  sigmoid  updating  probabilities  of  Eq.  1  were  fitted  to  the  experimental  data  points.  Note 
that  all  the  sigmoid  curves  are  shifted  to  the  right  This  shift  corresponds  to  the  smallest  energy  required  for  the 
correct  switch-on  operation  to  take  place  and  was  caused  by  intrinsic  asymmetry  of  the  pair.  It  corresponds  to 
an  energy  of  about  5  pj. 


Fig.  2 


Force  [pj] 


Processor  architecture  using  this  stochastic  operation  for  low  level  image  processing  with  arrays  of  pnpns  will 
be  discussed  at  the  conference. 


*J.  L  Pankove  et  al.  A  pnpn  optical  switch,  SPIE  vol.  963,  p.  191,  Optical  Computing  88, 1988. 

2P.  Iferemam,  M.  Kuijk,  R.  Vounckx,  G.  Borghs,  Fast  and  sensitive  two-terminal  double-heterojunction 
optical  thyristors.  Microelectronic  Eng.,  19,  p.49, 1992. 

5P.  Lalanae  et  al.,  2-D  generation  of  random  numbers  by  multimode  fiber  speckle  for  silicon  arrays  of 
processing  elements.  Opt  Comm.,  76,  p.  387, 1990. 

4  J.  W.  Goodman,  "Statistical  Properties  of  Laser  Speckle  Patterns",  in  Laser  Speckle  and  related  phenomena, 
J.C.  Dainty,  ed„  Chap.  2  (1975). 
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Abstract 

Massively  parallel  processing  (MPP)  requires  interconnection  schemes  for  switching  entire 
computer  words  in  parallel.  A  wide  format  switch,  based  on  free- space  global  interconnects 
and  optoelectronic  ExOR  gates  and  capable  of  switching  computer  words  in  parallel  under 
optical  control  is  proposed.  Distributed  and  centralized  routing  control  is  discussed  and 
hardware  realization  is  proposed. 


Summary 


L  Objective 

Massively  parallel  processing  requires  highly  parallel  communications,  and  such 
communications  has  become  a  botdeneck  with  conventional  electronic  implementations  because  of 
the  technological  limitations  of  electrical  interconnection  in  terms  of  area,  latency,  and  power 
dissipation  [1,2].  There  are  many  efforts  in  developing  free-space  interconnection  systems  [3].  As 
a  rule,  these  efforts  are  best  suited  to  telecommumcarion  applications  but  not  to  the  computer  area. 
In  computer  applications  entire  computer  words  consisting  of  64-128  bits  should  be  switched  in 
time  simultaneously.  This  requires  the  design  of  a  wide-format-switch  (WFS)  with  the  minimal 
number  of  switching  and  processing  elements  to  achieve  minimal  latency  and  die  highest  possible 
to  rate  transmission. 

n.  Approach  to  the  Solution  of  the  Problem 

The  basic  building  block  for  multi-stage  interconnection  networks  (MIN)  is  the  2x2 
crossbar  switch.  If  A  and  B  are  inputs,  the  outputs  of  a  cross-bar  switch  are:  D  *  AC  +BC  and 
E  *  BC  +AC,  where  C  is  the  control  signal.  If  C  ■  1,  the  outputs  are  D  *  A  and  E  =  B  (bar 
state),  but  if  C  ■  0,  then  D  »  B  and  E  -  A  (cross  state).  There  are  many  ways  to  implement  the 
cross-bar  switch  function  through  complete  logic  function  sets.  For  example,  a  wide-format- 
switch  (WFS)  for  cross-bar  switching  could  be  implemented  through  an  optoelectronic  ExOR  gate 
array  and  free  space  interconnects.  An  8x2  WFS  implementation  for  two  8  to  computer  words  is 
shown  in  Fig.1.  The  interconnection  pattern  for  two  digits  is  shown  only.  For  an  even  number  of 
MIN  stages,  input  and  output  words  will  be  in  the  same  representation  system  -  no  additional 
complementing  will  be  needed.  Notice  that  the  control  signal  C  is  applied  to  all  of  the  bits  in  two 
computer  words,  thus  achieving  switching  of  the  entire  words  simultaneously  and  in  parallel. 
Major  advantageous  of  optics,  such  as  -  .lgnificantly  larger  fan-out  than  in  electronics  and  less 
power  for  long,  off-chip  runs,  are  used  in  this  design. 

in.  Control  Issues 

A  Multistage  Interconnection  Network  is  a  practical  compromise  to  create  an 
interconnection  mechanism  that  allows  each  processor  to  connect  to  only  one  other  processor  at  a 
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time,  but  this  interconnection  pattern  should  be  dynamically  reconfigurable.  The  routing  control 
for  multistage  networks  can  be  centralized  or  distributed  [2],  and  both  of  these  techniques  could  be 
implemented  in  a  MIN  with  WFSs.  In  distributed  control  the  destination  address  propagates  in 
parallel  to  the  message,  thus  eliminating  latency  associated  with  address  decoding.  At  each  step 
only  part  of  the  destination  address  is  decoded.  In  centralized  control,  paths  are  established  via  a 
controller  which  knows  "a  priori”  the  most  efficient  switch  setting  for  a  given  permutation.  An 
optoelectronic  controller  that  could  be  used  for  the  centralized  control  of  a  MIN  has  already  been 
proposed  by  us  [4].  In  the  holographic  ROM,  which  serves  as  the  interconnect  path  memory, 
routing  paths  are  pre-recorded,  and  the  desired  paths  are  selected  by  the  input  OP  Code.  There  is 
an  internal  feed  back  loop  in  the  controller  which  gives  rise  to  the  possibility  of  reconfiguring  the 
path  routing  automatically  in  accordance  with  the  programs  stored  in  the  holographic  memory  and 
in  the  MPP  executing  routine. 
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As  far  as  practical  implementation  is  concerned,  the  proposal  for  hardware  realization, 
based  on  an  Ex  OR  smart  pixel  array  and  holographic  interconnects  will  be  discussed.  The 
important  issue  is  mean-time-bctween-tailure  of  the  newly  proposed  interconnecting  scheme. 
Factors  affecting  die  BER  in  this  free- space  interconnected  scheme  will  be  considered. 

Support  for  this  work  was  provided  by  the  University  of  Colorado  Optoelectronic 
Computing  Systems  Center. 
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Abstract 

A  shortest  path  hunt  algorithm  is  simulated  for  the  Optical  Cellular  Logic  Image  Proces¬ 
sor.  Its  efficiency  is  compared  with  its  serial  implementation. 

In  1961,  Lee  developed  a  maze  algorithm  capable  of  finding  the  shortest  connections  between 
two  given  points  within  a  rectangular  grid  filled  with  obstacles  [1].  For  this  algorithm,  the  near¬ 
est  neighbours  of  the  starting  point  are  marked.  The  labelling  is  repeated  in  increasing  order  to 
the  next  nearest  neighours  until  the  finish  point  is  reached.  The  shortest  path  is  then  found  by 
tracing  back  the  increasing  sequence  of  the  labels  generated  by  the  expansion  process  (figure  la). 

The  shortest  path  hunt  algorithm  was  later  derived  for  single  instruction  multiple  data  (SIMD) 
computers  within  the  logic  image  algebra  formalism  [2,  3].  The  optical  implementation  necessi¬ 
tates  however,  at  certain  stages,  a  fan-out  corresponding  to  the  number  of  elements  of  the  grid. 
The  new  version  presented  here  utilizes  a  four-nearest  neighbour  interconnect  such  as  in  the 
optical  cellular  logic  image  processor  (O-CLIP)  [4].  The  O-CLIP  implementation  of  this  algo¬ 
rithm  has  been  simulated  by  a  distributed  array  processor  (DAP).  Results  of  this  simulation  and 
benchmark  to  the  serial  Lee  routine  version  will  be  presented. 

A  schematic  of  the  O-CLIP  architecture  adapted  to  the  algorithm  is  shown  in  figure  2.  All 
the  information  as  to  the  labelling  of  the  elements  can  be  held  on  just  three  label  planes,  which 
greatly  eases  the  programming  of  the  O-CLIP.  Three  different  labels  are  in  fact  the  minimum 
number  needed  to  trace  the  shortest  path  from  the  finish  point  back  to  the  starting  point  (figure 
lb). 
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Abstract  In  this  paper  the  architecture  of  a  3D  O-E  hybrid  multiprocessor  is  proposed.  And 
the  O-E  hybrid  interconnection  network  is  used  for  this  system. 

1.  The  architecture  of  the  optoelectronic  hybrid  parallel  multiprocessor  system 

A  optoelectronic  hybrid  parallel  multiprocessor  system  is  building  up  in  our  laboratory;  in 
which  there  are  three  stages'  processor  arrays  (2DPA);  the  optical  interconnections  are 
employed  between  these  arrays,  as  shown  in  Fig.  lw.  Its  architecture  is  the  3D  pipeline. 

i„  (PA),  I/O  Array 


Fig.  1  The  scheme  of  O-Ehybrid  parallel  multiprocessor  system  4 

(2DPA)  ZD  Prawit  Eta— ■  Any ;  (PA)— Pmorawt  Fltmti  Array,  (SH)— Switch  Network;  (LA)— LED* 

Array;  (DA)— PIN* Array;  (FC)— fiber-optic CkMod,  (OEN) — Oi  bytorid iMercramectroa Network. 

Every  two  dimensional  O-E  hybrid  processor  array  (2DPA)  comprises  NxN  processing 
dements  (PEs).  The  interconnection  in  2DPA  is  electronic.  It  can  be  easily  reconfigured  into 
various  topologies 

2.  O-E  hybrid  Interconnection  network 

The  optical  interconnection  provides  the  communication  of  two  PEs  located  at  two 
2DPAs,  respectively,  in  the  third  dimension.  Tha  it  can  be  described  as  follows: 

(2DPA).  (PElj)0(2DPA)m(PEkl) 

where  n,m=l,2,3  and  ij,k,l=l~8.  This  is  an  O-E  hybrid  interconnection  network,  showing  in 
Kg.  2:  a)  the  photo  of  emitting  array,  fiber-optic  channels  and  receive  array,  b)  the  photo  of 
data  transmission. 


Fig.  2.  a)  The  photo  of  Emitting  Array,Fiber-optic  Channel  and  Receive  Array 
b)  The  photo  of  data  transmission 


3.  Implementation  of  topological  reconfiguration 

The  implementation  of  the  topological  reconfiguration  relays  on  an  electrically  addressing 
device,  that  is  an  Mx  M  (I/O)  crossbar  switch.  In  this  system  a  2Mx  2M(I/0)  crossbar  device  is 
needed  So  the  extended  complexer  of  the  crossbar  has  to  be  developed  with  3n  crossbars 
mentioned  above.  As  n=2,  6  crossbars  with  MxM  (I/O)  are  used  to  obtain  a  crossbar 
complexer  with  2Mx2M  (I/O).  It  can  be  easily  programmed  to  determine  which  input 
connect  to  which  output  by  instructions. 

In  2DPA,  the  PEs  with  four  pairs  of  input/output  links  are  interconnected  each  other  by 
using  the  two  extended  complexer  of  crossbars  with  2Mx  2M  (I/O).  By  programming  such  two 
crossbars,  various  topologies  such  as  mesh,  tree  and  hyper  cube,  etc.,  could  be  reconfigured. 
Fig  3  shows  a)  the  scheme  of  the  interconnection  network;  b)  the  photo  of  (2DPA) 


Fig.  3,  a)  The  scheme  of  the  Interconnection  network;  b)  The  photo  of  (2DPA) 

4.  Conclusion 

1)  A  O-E  hybrid  multiprocessor  system  is  accomplished. 

2)  A  O-E  fiber-optic  interconnection  network  is  used  in  this  system  successfully. 
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Abstract 

A  cellular  image  processing  technique  to  detect  edges  of 
gray-image  in  an  optical  shadow-casted  architecture  is 
described.  Coding  of  image  is  done  for  different  preselected 
threshold  levels  and  edges  are  obtained  in  a  single  operation. 


Summary 

For  achieving  parallel  gray-image  cellular  processing  [1] 
each  cell  ILj  of  the  image  is  assigned  to  a  coded  spatial 
pattern,  termed  as  virtual  processing  element  ytj.  The  coded 
spatial  patterns  are  generated  for  conversion  of  gray-image  I 
to  binary  virtual  image  V,  from  gray  pixels  of  the  original 
image,  the  size  of  which  is  dependent  on  the  number  of  gray 
levels  selected.  In  terms  of  image  logic  algebra  (-ILA)  (2]  the 
output  edge-detected  state  of  V  can  be  obtained  by  the 
following  operations: 

0|M;H|  -  V  |  M ;  H  |  .  ( V  |  H ;  R  |  ♦  dotV|H;R| 

*  *-i.oVT*TTT  +  *o.-4VT*T*T> 

where  0|M;N|  is  the  output  edge  detected  state  of  V.  M  and  N 
are  the  number  of  horizontal  and  vertical  cells  in  the  image 
respectively  and  theref oreivaries  from  1  to  M  and  j  varies 
from  1  to  H.  V|M;N|  and  Vpj ; H |  are  the  input  virtual  image  and 
its  inverted  forms  respectively.  0Ky  is  a  shift  operation  by  x 
cells  horizontally  and  y  cells  vertically  over  an  image.  '+' 
and  are  the  logical  OR  and  AND  operations  respectively. 

To  detect  edges  in  a  shadow-casted  optical  architecture 
the  input  virtual  image  V  is  inverted  and  optically  shifted 
through  one  Vy  cell  in  four  directions  and  logically  OR-ed. 
Subsequently  the  AND  operation  between  the  resultant  image  and 
the  input  image  Vtj  is  obtained  by  optical  parallel  array 
logic  (OPAL)  processing  [3],  After  proper  sampling  of  the 
processed  image,  the  edges  are  obtained  for  the  virtual 
pattern.  The  processed  virtual  output  image  is  then  decoded  to 
give  the  edge  detected  image  at  multiple  pre-selected  levels. 
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The  proposed  experimental  set-up  (fig.l)  consists  of  a 
pair  of  four  LED  sources  where  operational  kernels  are 
selected  by  proper  on-off  states  of  the  LEDs  for  desired  logic 
operation.  A  rule  is  proposed  for  multiple  level  coding  of 
gray-level  image  which  converts  I  to  V.  Results  obtained  for  a 
simple  gray  object  in  the  actual  set-up  proves  the  validity  of 
the  method.  A  computer  simulated  edge  detection  based  on  the 
proposed  technique  for  a  tomoscan  picture  (fig. 2a)  is 
presented  with  four  and  eight  intensity  levels  in  fig. 2b  and 
fig. 2c  respectively. 
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ABSTRACT 

New  abilities  of  iaage  processing  appearing  from 
introduction  of  controlled  local  connections  between  cells  in 
optoelectronic  trigger  medium  are  discussed. 

1.  An  optoelectronic  memory  usually  consists  of 
independent  electronic  cells  with  optical  inputs  and  outputs. 
We  consider  new  abilities  of  optoelectronic  memory  arising  from 
local  connections  between  cells.  With  specific  types  of 
connections  memory  medium  is  able  to  extract  (process)  the 
informative  features  of  recorded  images.  Resolution  and  memory 
capacity  are  determined  by  the  microelectronic  technology 
possibilities.  We  will  refer  to  a  memory  medium  with  connected 
cells  as  CMM. 

2.  Consider  the  binary 

optoelectronic  memory.  Every 

cell  comprises  trigger  T  with 

an  optical  input  and  output. 

The  projected  onto  medium 

image  is  percepted  by 

photocell  PC,  and  is  recorded 

by  flip-  over  from  state 

T  to  state  T  (T  •»  T  ).  The 
1  2  1  2 

circuits  (photocells)  for 

reading  (CR)  and  destroying 

(CD)  are  switched  off  during 

recording . 

Uniform  illumination  of 

all  CR  turns  on  light 

emitters  (LE);  it  restores 

recorded  images  (when  the 

photocells  CR  and  CD  are 

turned  off) . 

The  picture  is 

destroyed  by  flip-  over 

T  *  T  by  uniform 

2  X 

illumination  of  all  CD. 

Every  cell  is  connected 
with  its  nearest  eight 
neighbors.  The  connections 
are  of  inhibitory  type;  they 
block  up  connected  cells, 
i.e.  prevents  T  *  T2  flip- 

over.  A  special  threshold 
element  (TE)  (coupled  with  CD 
to  the  arm  of  trigger  T  )  combines  all  connections  from 
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neighboring  cells.  The  threshold  control  unit  (TC)  sets  the 
nuaber  of  connections  (N)  necessary  for  a  signal  to  appear  at 
the  TE  output.  This  signal  switches  the  aeaory  cell  off 
(T^  *  T  )  in  the  same  way  as  CD  signal  does.  The  system  also 

includes  a  connection  distributor  (D)  and  light  emitter  control 
unit  (EC).  When  LE  is  turned  on,  after  the  delay  time  the  ES 
unit  switch  the  LE  off  for  the  whole  image  analysis  cycle. 

The  main  feature  of  CMM  with  the  inhibitory  connections  is 
that  the  aeaory  state  of  the  chosen  cell  depends  only  on  the 
number  of  illuminated  neighbor  cells  which  prevent  to  T^ 

flip-  over  by  their  connections.  Of  course,  there  is  also  a 
back  inhibitory  effect  of  the  chosen  cell  on  its  neighbors. 

3.  We  discuss  the  possibilities  of  CMM  for  image 
processing.  We  assume  that  the  minimal  size  of  a  picture 
element  is  of  the  same  order  as  of  a  cell  size  and  the  distance 
between  cells  is  substantially  smaller  than  the  linear  cell 
size . 

When  N  =  1,  the  memory  records  only  isolated  elements  of  a 
size  of  the  order  of  a  cell,  with  at  least  a  one  cell  spacing. 
It  is  not  possible  to  record  another  picture  elements,  because 
even  one  inhibitory  connection  prevents  this.  The  addition  of  a 
new  neighboring  illuminated  element  leads  to  local  information 
destruction. 

When  N  =  2  the  memory  extracts  both  1-  cell  and  arbitrary 
2-  cell  spots.  If  N  =  3  it  is  also  possible  to  record  3-  cell 
spot s  and  arbitrary  lines  and  contours  of  a  one  cell  thickness; 
then  the  neighboring  elements  are  connected  only  by  two 
inhibitory  connections.  With  the  increasing  of  N  the  minimal 
size  of  recorded  spots  is  increased,  and  cells  in  off  state  can 
appear  inside  illuminated  spots. 

Large-  pattern  contours  can  be  extracted  when  N  >  5:  inner 
cells  are  inhibited,  whereas  contour  cells  (which  have  no 
illuminated  neighbors  outside  the  pattern)  are  not  inhibited. 

Analysis  of  pictures,  maps  etc.  is  best  carried  out  by 
"step  by  step"  cycle  increasing  N  from  1  to  8  with  exclusion  of 
fragments  extracted  at  the  (N  -  l)th  step.  In  fact  CMM  can  be 
used  for  decomposition  of  the  initial  picture  into  fragments: 
small  spots,  contours  of  large  spots,  lines,  lines  crossings, 
angles,  etc. 

4.  An  important  possible  application  is  extraction  of 
moving  fragments.  The  initial  picture  is  projected  onto  the  CMM 
and  then  all  corresponding  LE  are  turned  off  by  unit  ES.  Then 
the  initial  picture  with  shifted  elements  is  projected  again. 
Obviously,  only  emitters  of  the  cells  corresponding  to  recorded 
shifted  elements  are  turned  on.  The  subsequent  projecting 
operations  make  it  possible  to  determine  the  directions  and 
velocities  of  moving  elements.  This  method  is  also  suitable  for 
extraction  of  motionless  but  distinct  parts  of  patterns. 

5.  The  possible  applications  of  CMM  are  expanded  if  every 
cell  is  connected  not  only  to  the  nearest  cells  or  not  only  by 
inhibitory  connections.  Activating  excitatory  connections  (with 
inhibitory  connections  switched  off)  for  certain  time  it  is 
possible  to  enlarge  the  size  of  extracted  spots  and  the  width 
of  lines,  and  to  spline  indented  edges. 
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Abstract 

A  system  of  oscillators  with  optical  inputs  and  outputs 
transforms  a  projected  iaage  into  spatial  distribution  of 
oscillator  frequencies.  Resonant  excitation  of  different 
groups  of  oscillators  is  discussed  as  a  new  method  of  iaage 
processing . 

1.  The  main  idea  of  the  discussed  approach  is  the 
following  [1].  Suppose  there  is  a  matrix  of  identical 
quasiharaonic  noninteracting  oscillators  with  optical  inputs 
and  outputs.  The  image  I(X,Y)  is  projected  onto  the  matrix  and 
it  leads  to  the  change  of  oscillators'  frequencies  : 

o  >*  w  =  »  ♦  aI(X,Y) 

0  loc  0 

Hence  the  image  is  encoded  into  the  spatial  distribution  of 
frequencies. 

Then  the  homogeneous  pumping  is  projected  onto  the  medium 
through  a  wide  aperture  modulator  with  a  harmonic  regime  of 
modulation  (frequency  Q).  The  pumping  excites  only  such  parts 
of  medium  which  correspond  to  the  condition:  »  (X,Y)  %  Q. 

The  parametric  resonance  can  be  used  by  the  same  way. 
Therefore  varying  Q  we  can  provide  the  parallel  extraction  of 
regions  of  equal  intensity.  If  oscillators’  inputs  are 
sensitive  with  respect  to  light  wavelength  it  is  possible  to 
extract  regions  of  certain  color. 

Example  of  a  problem  which  can  be  solved  by  the  proposed 
method.  A  sheet  of  paper  contains  a  set  of  patterns  drawn  with 
different  intensities  (or  colors).  Projecting  the  sheet  onto 
the  medium  and  using  a  set  of  appropriate  Q  it  is  possible  to 
extract  every  pattern. 
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2.  We  have  studyed  different  Modifications  whereas  thr 

additional  elements  are  used:  passage  of  analyzing  pictures 

through  a  Modulator;  periodic  shift  of  picture  along  the 

Mediua;  usage  of  background  light  puap  distribution  in  the 

fora:  I#(X,Y)  =  Ifl  +  F(X,Y),  where,  for  example,  F(X,Y)  = 

F  (X2  +  Y2  ) .  Such  coaplications  allows  us  to  realize  the 
0 

following  procedures:  extraction  of  certain  parts  of  picture; 
reaoving  small  scale  distortions  from  patterns;  extraction  of 
contours,  points  of  extrema;  correlative  comparison  of  images; 
determination  of  common  and  distinct  parts  of  images,  moving 
eleaents,  etc. 

3.  Media  of  oscillating  elements  can  be  realized  by  the 
several  ways . 

a)  Matrices  of  microcavity  lasers. 

There  are  structures  in  which  every  microcavity  laser  is 
integrated  with  a  photocell  for  noncoherent  light  [2]. 
Informative  light  flow  is  percepted  by  the  photocells  and 
changes  the  frequency  w  of  laser  electron-  photon  resonance 
(»  -  10  s'  ) .  Existing  electrooptical  modulators  can  excite 
this  resonance  by  modulation  of  pumping  light.  So  we  only  have 
to  find  the  fact  of  lasers’  modulation. 

Matrices  of  microcavity  lasers  without  photocells  can 
also  be  used  when  the  coherent  informative  light  flow  is  used. 

b)  Matrices  of  passive  optical  bistable  elements  in 
regime  of  optical  oscillations  [3]  can  be  used  by  the  same 
way. 

c)  Analog  microelectronic  matrices  containing  oscillating 
circuits  with  optical  inputs  and  outputs  are  convenient  too. 
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Abstract .  An  approach  of  “transverse  interconnection  optical  processor 
architecture"  is  proposed  which  allows  flexible  optically  programmable 
data  exchange  between  individual  beam  channels  without  changing  their 
relative  dispositions. 

Using  the  light  as  information  carrier  implies  usually  that  the 
direction,  in  which  data  are  transported  in  free  space  or  in  waveguide, 
coincides  with  the  light  beam  direction.  However,  there  are  a  number  of 
nonlinear  optical  phenomena  such  as  transverse  effects  in  optical 
bistability  that  allow  to  direct  the  information  flow  perpendicularly  to 
the  light  beam.  When  studying  such  nonlinear  transverse  phenomena  it  has 
been  found  that  there  are  different  steady  profiles  of  the  output 
beam/image  intensity  for  the  same  distribution  of  input  light  interacting 
with  a  bistable  interference  layer.  The  selection  of  a  desired  profile  as 
well  as  transitions  between  them  can  be  easily  controlled  optically.  It 
gives  a  basis,  for  Instance,  for  all-optical  implementation  of  2D-data 
shift.  In  this  method  all  the  stages  of  the  information  transfer  are 
performed  in  the  plane  of  matrix  of  optical  elements.  It  provides  an 
opportunity  for  organizing  the  interconnections  and  information  exchange 
between  neighbour  logic  elements/pixels  within  2D-array  leading  to  new 
optical  computer  architectures. 

In  this  paper  the  method  of  "transverse  lock-and-clock"  processing 
[Ref.]  is  extended  to  a  "planar  -  free  space"  optical  interconnections  and 
circuits.  The  basic  idea  of  the  "planar  -  free  space"  architecture  is  to 
combine  the  parallel  data  transfer  between  sequentially  located  (within 
loop  parallel  processor)  matrices  of  switching/bistable  elements  and 
planar  transverse  interconnections  and  processing  within  the  plane  of  each 
matrix. 

In  such  an  approach  an  ensemble  of  light  beams  (a  digital  image) 
preserves  the  topology  of  their  relative  disposition  (with  no  shuffle, 
etc. )  when  propagating  in  free  space  between  two  sequentially  located 
nonlinear  matrices,  in  contrast  to  other  flexible  interconnection 
architectures.  The  flexibility  of  the  information  connections  between 
different  parts  of  the  image  is  reached  due  to  the  optically  programmable 
transverse  transport  and  redistribution  of  the  different  logical  states  of 
transmission/reflection  within  the  matrices  of  nonlinear  elements.  As  a 
result,  the  data  in  different  individual  channels  are  exchanged  without 
changing  the  spatial  positions  of  the  beams  that  carry  the  data  along  the 
processor  loop. 

The  examples  of  design  and  experimental  modeling  of  all-optical 
serial-parallel  and  parallel -serial  data  convertors,  stack  and  associative 
memory,  multiplexor-demultiplexor,  non-blocking  crossbar,  planar  loop 
circuits  and  regular  networks  are  given  (Fig. )  using  2D  bistable  thin-fllm 
Fabry-Perot  interferometer  with  optical  aperture  of  40mm. 
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serial-parallel  parallel-serial 


data  conversion 


”last-in-first-out”  or  ”  first-in-first-out”  multiplexor-demultiplexor, 

non-blocking  crossbar 


planar  optical  circuits  regular  networks 


Fig.  Optical  "planar  -  free  space"  logic  circuits 
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Abstract  t  Operation  of  shift  register  based  on  propagation  of 
switching  waves  in  distributed  nonlinear  aedia  is  studied  by  numerical 
simulations.  Comparison  with  experimental  results  is  also  presented. 

The  concept  of  lock-and-clock  architecture  for  optical  data 
processing  has  been  recently  modified  to  take  into  account  two  transverse 
degrees  of  freedom  and  all-optical  shift  register  based  on  this  concept 
has  been  demonstrated  experimentally  [Ref].  Information  shift  in  the 
register  takes  place  in  the  plane  of  matrix  of  optical  bistable  elements. 
For  further  development  of  this  concept  an  adequate  theoretical 
description  would  be  useful. 

In  present  contribution  we  describe  theoretical  model  of  a  planar 
all-optical  chip  consisting  of  an  array  of  bistable  elements  that  could  be 
coupled  using  transverse  Interconnections.  Analysis  of  the  model  is  based 
on  solving  coupled  equations  for  light  field  in  nonlinear  interference 
layer  and  equations  that  describe  phenomenologically  diffusion  of  medium’s 
nonlinear  parameters.  Light  and  heat  energy  balance  in  the  system 
"bistable  layer  -  substrate"  is  taken  into  account.  The  approach  can  deal 
with  various  types  of  nonlinearity,  such  as  band-filling  or  free  carrier 
generation.  In  particular,  we  use  heat  transfer  equations  for  substrate 
and  layer,  that  is  necessary  when  behavior  of  TFI  with  thermal 
nonlinearity  is  simulated. 

Results  are  presented  for  switching  dynamics  of  bistable  elements  at 
a  transition  from  one  logical  state  to  the  other.  The  shift  of  binary  data 
in  the  chip’s  plane  by  means  of  optically  controlled  switching  waves  is 
also  modeled  numerically.  Temporal,  spatial  and  power  parameters  have  been 
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Investigated  that  allows  to  optimize  2D  lock-and-clock  data  processing 
using  bistable  matrix.  The  above  parameters  define  rates  of  data  transfer 
and  processing,  spatial  resolution  and  information  capacity  of  matrix 
devices,  error  rate,  life  time  etc.,  and  are  important  for  architecture 
development.  Most  of  the  obtained  results  agree  well  with  the  previous 
experimental  data.  The  rate  of  data  transfer  in  intra-chip 
interconnections  and  in  shift  registers  is  estimated. 


3000 1 


Ovi.Inteis. 

W/cm* 


3000 


(a) 


3000 

3000 


Omtlateu. 

W/cm1 


(b) 


0.006  0.01  0.016 
space,  cm 

By  way  of  illustration  profiles  of  transmitted  Intensity  for  two 
bistable  pixels  with  no  influence  on  each  other  (a)  and  with  distinct 
degree  of  coupling  (b)  are  presented.  It  can  be  seen,  that  in  second  case, 
when  pixels  are  placed  closer,  conditions  for  transverse  transfer  of 
information  signal  are  net. 
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ABSTRACT 

Principles  for  optical  digital  data  processing  based  on  photon  echo  phenomenon  in 
a  resonant  media  are  considered.  The  various  schemes  of  optical  processors  realization 
with  the  pixel  structure  and  of  holographic  type  based  on  using  of  the  digital 
multiplication  by  analog  convolution  (DMAC)  algorithm  are  suggested.  Manners  of 
optical  data  flow  switching  of  informational  channels  on  the  basis  of  photon  echo 
phenomenon  are  discussed. 


SUMMARY 

The  Photon  Echo  (PE)  phenomenon  at  present  is  not  only  the  method  of  kinetic 
relaxation  processes  studing  in  a  resonant  media,  but  also  is  the  mean  of 
multifunctional  processing  of  optical  images,  formed  by  the  excitation  pulses  at 
different  time  moments.  Primary  proposals  of  PE  using  concerned  in  the  main  of 
analog  processing  methods  [1-3].  These  methods  are  characterized  by  very  high-speed 
processing  and  can  be  used  in  the  problem  of  pattern  recognition  [4],  However  there  is 
interest  in  the  development  of  optical  information  processing  methods  based  on  the  PE 
phenomenon  for  the  realization  of  digital  data  processing  and  of  optical  digital 
processors  design. 

Here  we  analize  the  optical  processing  for  realization  of  vector  and  vector-matrix 
algebra  operation  as  a  intrinsic  (scalar)  and  external  vector  product  calculations,  vector- 
matrix  and  matrix-matrix  multiplication. 

As  a  main  principle  of  optical  digital  information  processing  the  digital 
multiplication  by  analog  convolution  (DMAC)  algorithm  is  usually  used  [5,6].  We 
consider  two  types  of  optical  schemes  with  using  of  this  algorithm  in  time-domain  and 
space-domain  fields. 

In  the  first  case  the  binary  representation  of  the  multipled  numbers  is 
accomplished  in  the  form  of  temporal  sequential  code  and  the  processing  takes  place  in 
the  independent  space  structural  regions  (pixels).  In  each  pixel  of  resonant  medium  the 
convolution  or  correlation  functions  corresponding  of  two  pulses  in  time  are  calculated. 
On  the  whole,  the  spatial-temporal  light  modulator  (SLM)  has  to  form  different  images, 
the  number  of  which  is  equal  to  the  significant  digit  of  using  representation  for  under 
the  time  action  either  of  exciting  light  pulses.  The  adding  operation  of  the  different 
components  is  accomplished  by  cylinder  lens. 
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In  the  second  case  (so-called  the  scheme  of  holographic  type)  all  numbers  are 
introdiced  in  parallel  code  and  DMAC  algorithm  is  realized  in  space.  A  compound 
optical  scheme  accomplishes  one-dimensional  Fourier  transformation  on  one  coordinate 
and  other  coordinate  forms  optical  image  and  compensates  the  phase  distortion.  In  this 
case  the  resonant  medium  is  a  spectral  plane  of  multichannel  dynamic  hologram  with 
the pne-dimensional  filtering  of  spatial  frequencies.  As  a  result,  the  data  of  products  of 
different  vector  components  AjBj  in  the  mixed  binary  representation  is  formed  in 
output  plane.  The  cylinder  lens  executes  the  addition  operation  and  the  signal  that  is 
proportional  to  the  scalar  multiplication  of  two  vectors  A  and  B  is  arrised  on  the  linear 
array  of  photodetectors. 

By  the  particular  case  of  digital  data  processing  the  data  flow  switching  may  act 
as  a  control  data  transmission,  that  is  information  channel  switching. 

Data  flow  switching  can  be  easily  implemented  on  the  principle  of  vector-matrix 
multiplication.  In  the  first  scheme,  called  by  the  scheme  with  the  pixel  spatial  structure, 
the  setting  of  vector  components  is  initiated  by  the  specification  of  images  with 
identical  rows  and  column  numbers  being  equal  vector  dimension,  and  addition 
operation  is  accomplished  by  the  cylinder  lens. 

In  the  scheme  of  holographic  type  each  image,  formed  by  the  spatial-temporal 
light  modulator  represents  the  vector  with  the  component  number  on  one  coordinate 
equal  to  dimension  of  defined  vector  and  with  number  of  binary  digit  on  other 
coordinate.  Thus  so  far  as  matrix  A  defines  under  the  time  action  of  one  of  light  pulses 
in  a  view  of  M  images  (that  corresponds  of  definition  of  M  vectors  or  data  matrix),  that 
is  in  a  view  of  different  vector  sequence  in  time  than  the  channel  numeration  on  input 
must  correspond  to  definition  of  number  sequence  in  time. 

Thus,  the  digital  data  processing  principle  is  based  on  filtering  in  a  resonant 
medium  usually  time  frequencies  in  processor  with  the  pixel  structure  and  in  processor 
of  holographic  type  -  on  multichannel  one-dimensional  filtering  of  spatial  frequencies. 
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A  general  neural  unit  with  a  complex -valued  matrix  depicting 
integrated  synaptic  connection  and  lateral  interaction  is  proposed. 
Various  connections  can  be  self -programmed  by  a  complex  routing 
code  pair  contained  in  the  input.  Based  on  the  mixed  negabinary 
number  system,  an  optical  and  parallel  neural  architecture  of 
incoherent  optical  correlation  and  spatial  coding  is  developed. 

Various  artificial  neural  net  models  were  suggested  to  simulate 
the  biological  behaviors [1] ,  in  which  neurons  were  usually 
considered  to  have  a  set  of  two  real  states.  In  this  paper,  we 
suggest  a  new  concept  that  neurons  have  complex- valued  states, 
which  stimulate  a  combination  of  synaptic  connection  and  lateral 
interaction.  On  this  basis,  a  general  neural  unit  with  only  a 
complex -valued  matrix  is  proposed  to  organize  the  integrated 
synaptic  connection  and  lateral  interaction.  Thus,  vai ious 
connection  functions  can  be  self -programmed  by  the  routing  code 
contained  in  the  input  stimuli.  Based  on  the  mixed  negabinary 
number  system,  an  inner-product  algorithm  for  complex  matrix-vector 
multiplication  necessary  for  the  neural  unit  is  developed.  The 
features  are:  no  carriers,  no  signs  and  simple  pre-processing  and 
post -processing.  An  optical  architecture  of  incoherent  correlation 
with  spatial  coding  of  data  are  suggested.  The  required  complex 
matrix-vector  multiplication  can  be  realized  optically  in  parallel 
in  a  single  optical  channel  with  a  high  accuracy. 

The  model  of  complex-valued  general  neural  unit  is  shown  in 
Fig.l.  The  states  of  the  input  stimuli  are  denoted  by  a  complex¬ 
valued  column  vector,  [Vj  (K,+jKj)  ]  where  [v^  is  the  bipolar  input 
stimuli  and  K^KgCfl,  0,  -1}  assemble  the  routing  code  pair.  The 
complex- valued  interconnection  matrix  is  divided  into  the  real  part 
and  the  imaginary  part  as  [TRij]  +  j  [T1^]  .  A  matrix- vector 
multiplication  will  result  in  a  real  part  and  an  imaginary  part  as 

[  v/]  [  Tij]  [Vj]  -K2  [  Tfj]  [Vj]  , 

[  v/]  [  T?j]  [Vj]  +K2  [  T*j ]  t  Vj]  . 


Thus,  the  access  of  different  routing  codes  will  yield  various 
connection  functions. 

We  can  now  establish  the  rule  to  learn  the  complex-valued 
matrix  from  the  matrix  for  synaptic  connection,  [T,yi:i]  (NxM,  MaN)  , 
and  the  matrix  for  lateral  interaction,  [T1*^]  (NxN) : 

l  Tfj ]  =  [  Tij  ]  ,  (2) 

C  t/j]  =([t//]  -  [1])  [Tg]  . 


The  negabinary  number  system  has  the  radix  of  -2  [2]  .  A  positive 
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or  negative  real  number  can  thus  be  represented  in  a  form  of  the 
mixed  negabinary  number: 


a  * 


n*  0 


*„<■ 


•2 )  ° 


(aniO) 


(3) 


For  the  neural  complex  matrix-vector  multiplication,  the  nth  real 
digit  at  the  ith  row  of  the  resulting  column  vector  can  be  reached 
by  the  inner-product  algorithm: 

v'*  (i)  -T,  KxT*{i,  j)  -vn(j)  -K2TxU,  j)  -vn(j)  , 

J  (4) 

substraction  conversion :  -a*b  =  —  a-b„  .. 

n  2  n  1 


It  should  be  noted  that  a  substraction  can  be  converted  to  an 
addition  operation  by  a  shift  operation. 

To  implement  the  complex  matrix-vector  multiplication  by 
optics,  an  incoherent  optical  correlation  and  spatial  coding 
architecture  is  suggested  as  shown  in  Fig. 2.  The  coding  format  of 
the  vector  and  the  matrix  are  indicated.  The  image  of  the  light 
array  represents  the  output  digits  in  the  mixed  negabinary  system. 

This  work  was  supported  by  the  National  Natural  Sciences 
Foundation  of  China. 
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Fig.l.  Caaqplex  valued  general  neural  unit. 


Fig. 2.  Optical  matrix-vector  proceaaor  using  negabinary  numbers. 


MP47/121 


Polarisation-optical  atackad  integrated  nodule 
of  cellular  logic  two-layer  image  processor 

Liren  Liu,  Haifeng  Peng,  and  Yaozu  Yin 

Shanghai  Institute  of  Optics  and  Fine  Mechanics 
Academia  Sinica,  P.O.Box  800-211,  Shanghai  201800,  P.R.  China 

A  3-0  polarization-optical  stacked  integrated  module  of  cellular 
logic  image  processor  is  developed,  which  is  the  optical  cascade 
of  a  parallel  binary  logic  gate  and  a  morphological  dilation 
processor.  The  system  is  very  compact  and  effective. 

Cellular  logic  image  processors1  is  an  important  architecture 
for  optical  computing.  The  recently  reported  optical  cellular 
image  processors  include:  The  optical  implementation  of  the 
cellular  logic  image  processor  (CLIP)  architecture1  by  using  S- 
SEED  devices2  and  BEAT  devices3;  A  digital  optical  cellular  image 
processor  based  on  binary  image  algebra  with  the  use  of  light 
valve4;  An  optical  cellular  two-layer  logic  image  processor  based 
on  one-operation  image  algebra  using  dual-rail  spatial  coding 
and  electronic  thresholding5.  In  these  suggestions,  however,  the 
optical  interconnections  were  realized  in  the  free  space.  A 
quite  distant  light  path  should  be  utilized,  so  that  the  optical 
system  is  large.  In  this  paper,  we  develop  a  new  optical  module 
scheme  using  3-D  stacked  integration  of  polarization  elements. 
It  can  been  seen  that  the  suggested  optical  cellular  image 
processor  module  is  compact  in  configuration,  effective  in 
performance,  and  insensitive  to  environment. 

Fig.l  shows  schematically  the  cellular  architecture,  which 
consists  mainly  of  a  CPU  of  a  parallel  binary  logic  processor 
followed  by  a  morphological  binary  dilation  processor  and  a 
feedbacking  manager.  The  threshold  device  and  sum  gate  are  used 
to  deal  with  gray-tone  images.  It  was  proven  that  all  the 
possible  binary  and  gray-tone  image  processing  functions  can  be 
executed  by  programming  the  logic  operations  in  the  iterations5. 

The  packaged  design  of  the  optical  CPU  module  is  shown  in 
Fig. 2.  The  calcite  plates  (CP)  are  used  to  split  an  incident 
beam  into  two  spatially  separated  beams  with  polarizations 
orthogonal  to  each  other,  the  quartz  plates  (PR)  to  rotate  the 
polarization  of  beam  by  45°,  the  input  masks  or  the  PLZT  SLMs 
(IM)  to  display  the  two  input  images  in  a  dual-rail  spatial 
coding  manner,  the  ML  mask  to  control  the  logic  operations,  and 
the  MS  masks  to  stop  the  undesired  beams. 

The  optical  elements  from  the  IM,  to  the  beamsplitter  (BS) 
constructs  the  parallel  logic  processor,  and  the  following 
elements  constitute  the  morphological  processor.  On  the  lower 
part  of  the  figure,  the  path  routing  of  beams  in  a  coding  area 
of  cell  is  illustrated.  It  is  seen  that  the  output  pattern  from 
the  logic  processor  at  the  BS  is  a  combination  of  four 
equivalent  maxterm  of  two  input  images.  The  selection  of  the 
ON/Off  states  (k,-k4)  of  the  four  apertures  in  each  coding  area 
of  the  ML  mask  can  result  in  all  the  sixteen  binary  logic 
operations.  In  the  morphological  processor,  a  beam  is  splitted 
into  four  spots  in  its  nearest  neighbors.  The  structuring 
element  S  is  a  set  of  { (1,0) , (-1,0) , (0,1) , (0,-1) }.  Using  a 
photodetector  array  with  electronic  max  thresholding,  the 
dilation  can  be  obtained. 


i 
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The  polarization-optical  stacked  module  has  been  developed: 
the  dimension  20mmX20mmX140mm  and  the  possible  image  pixels 
1  X10 .  Some  primary  experimental  results  were  obtained. 

This  work  was  supported  by  the  National  Natural  Sciences 
Foundation  of  China  and  the  Bureau  of  High  Technique  of  China. 
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Fig.l.  Cellular  logic  image  processor  architecture. 


Fig. 2.  Polarization-optical  stacked  integrated  cellular  CPU. 
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Abstract:  A  direct  2's  complement  array  multiplication 

architecture  is  proposed,  and  for  higher  radix  system,  it  also 
applies.  Based  on  complex  partition,  a  two-stage  array  is 
constructed  for  complex  operation.  In  correspondence,  an  optical 
configuration  is  put  forward. 

Twos  complement  encoding  has  been  widely  used  in  optical 
matrix  operation,  but  the  conventional  algorithm  has  several 
fatal  defects113,  such  as  the  limitation  on  the  bit  number  of  the 
operands  and  the  severe  waste  of  space  bandwidth  product (SBWP)  . 
The  modified  version111  needs  much  sign  decision  and 
postprocessing,  and  the  convolutional  result  is  still  unable  to 
be  directly  weighted  and  summed.  In  the  following,  another 
modified  direct  2's  complement  array  multiplication  algorithm  is 
advanced  and  applied  to  complex  operation. 

In  2's  complement,  assume  either  of  the  two  N-bit  operands 
a  and  b  takes  the  form  {xN.1xM.2--x.Xp} .  With  the  help  of  mixed  2's 
complement  number  system,  the  bitwise  addition  is  conducted 


without  carries  and  subtraction  is  changed  to  addition  according 
to  the  additive  inverse  representation  of  the  subtrahend.  For 
multiplication,  the  below  equation  can  be  derived123: 

+  (l ♦aJ(L15JM>  Tajbt'21** 

_  **  (l) 

+  E  (alf.1Ei*bihlai)  ■21*Ihl+  (a^+b^)  2lf~1. 


The  above  product  can  be  formed  with  a  parallel  array 
architecture  as  shown  in  Fig.l.  Compared  with  the  original 
algorithm,  the  direct  array  multiplication  eliminates  the 
limitation  on  the  bit  number  of  the  operands,  use  the  SBWP  more 
efficiently,  and  the  result  is  represented  in  mixed  2's 
complement  straightly.  Furthermore,  the  algorithm  can  be 
extended  to  higher  radix  system. 

For  complex  operation,  the  preceding  array  architecture 
applies.  Let  a=a1+ja2,  b^+jb,,  in  which  a1f  a,,  b1f  and  b2  are 
real  numbers  and  are  encoded  by  N  bits,  then  tneir  product 

P-P^jPz#  where  p1=a1b,+  (-a2b2) ,  p2=a1b?+a2b1 . 

Therefore,  a  two-stage  array  architecture  consisting  of  four 
subarrays  is  established  in  Fig. 2. 

The  two-stage  array  architecture  has  been  verified  by 
experiment.  The  optical  construction  is  shown  in  Fig. 3,  which 
comprises  three  parts.  The  first  imaging  part  forms  the  product 
array;  The  second  multiple-imaging  system3*3  carries  out  addition 
of  the  relevant  subarrays;  The  incoherent  correlator t53 
integrates  the  diagonal  elements  to  get  both  real  and  imaginary 
parts  of  the  result.  The  system  is  interfaced  with  a  PC  computer 
for  pre-  and  post-  processing. 

The  work  is  supported  by  the  National  Natural  Science 
Foundation  of  China. 
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Fig.l  Parallel  array  Fig. 2  An  array  complex  unit 
multiplication  scheme. 
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ABSTRACT 

A  new  technique  for  high-speed  recoded  trinary  signed-digit  arithmetic  using  optical  symbolic 
substitution  is  presented.  This  technique  performs  multi-bit  carry-free  addition  and  barrow-free 
subtraction  in  constant  time  using  the  minimum  number  of  min  terms. 


SUMMARY 

The  problems  of  achieving  massively  parallel  optical  computing  have  been  investigated  by  many 
authors  using  the  residue  number  system,  signed-digit  number  system,  and  recoded  signed-digit 
number  systems  (1-3].  Using  residue  number  system,  one  can  perform  parallel  arithmetic  in  constant 
time  using  symbolic  substitution  (4],  but  the  size  of  the  truth  table  required  increases  rapidly  with  the 
increase  of  the  operand  length.  The  redundant  signed-digit  number  system  [5]  allows  parallel 
arithmetic  with  fewer  carry  propagation  steps.  For  example,  the  modified  signed-digit  (MSD)  number 
system  confines  the  carry  propagation  to  two  adjacent  digit  positions  (5).  This  property  of  the  MSD 
number  system  permits  carry-free  addition  and  borrow-free  subtraction  of  two  arbitrary  length 
numbers  in  parallel  which  makes  it  suitable  for  optical  architectures.  However,  the  information 
storage  density  of  the  MSD  number  system  is  limited  which  may  result  in  complex  circuits  and  may 
increase  the  severity  of  the  interconnection  bottleneck  [6].  Higher  radix  signed-digit  number  systems 
allow  higher  information  storage  density,  less  complexity,  fewer  system  components,  and  fewer 
cascaded  gates  and  operations.  Among  the  higher  radix  number  systems,  the  trinary  signed-digit 
number  system  appears  to  be  the  most  promising  in  terms  of  storage  density  and  processing  elements 
16]. 

Carry-free  addition  using  n-step  [7],  three-step  [8],  two-step  [1],  and  one-step  [9]  optical  symbolic 
substitution  have  already  been  reported.  The  number  of  symbolic  substitution  rules  usually  decreases 
as  the  number  of  computational  steps  increases.  Recently,  a  two-step  symbolic  substitution  technique 
for  trinary  signed-digit  arithmetic  has  been  reported  [6].  This  technique  performs  carry-free  addition 
and  borrow-free  subtraction  by  checking  a  pair  of  reference  digits  from  the  next  lower  order  digit 
position  and  requires  58  four-variable  minterms  for  each  output  digit.  Later  on,  a  higher-order  trinary 
signed-digit  symbolic  substitution  technique  has  been  designed  [1]  which  enhances  the  computation 
speed  but  excessively  increases  the  number  of  minterms  per  output  digit. 

Parhami  [10]  recently  proposed  a  simpler  carry-free  addition  scheme  where  the  MSD  numbers  are 
recoded  before  performing  the  addition.  Awwal  [2]  implemented  the  recoded  MSD  arithmetic  technique 
using  an  opto-electronic  implementation  which  requires  sixteen  four-variable  minterms  per  output 
digit.  Thus,  in  any  symbolic  substitution  scheme,  the  most  important  objective  is  to  minimize  the 
number  of  minterms  (substitution  rules)  and  computational  steps  while  incorporating  the  optimum 
information  in  fewer  digits.  To  achieve  these  objectives  without  sacrificing  the  processing  speed  and 
parallelism  of  optics,  we  propose  herein  the  recoded  trinary  signed-digit  system  for  multi-digit 
carry-free  addition  and  borrow-free  subtraction. 
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The  proposed  trinary  signed-digit  technique  is  memory  efficient  since  it  reduces  the  substitution  rules 
by  more  then  50%  when  compared  to  the  most  recently  reported  trinary  signed-digit  arithmetic 
technique  [6].  Moreover,  only  three  two-variable  min  terms  are  required  for  the  addition  stage  of  the 
proposed  technique  while  six-variable  [1]  and  four-variable  (6]  min  terms  are  required  for  the  addition 
stage  of  the  previously  reported  schemes.  The  proposed  technique  can  be  implemented  mn«g  a  angle- 
stage  opto-electronic  implementation  where  the  recoding  operation  is  performed  electronically  and  the 
addition  is  performed  optically  using  a  content-addressable  memory  (CAM).  An  all-optical  CAM-based 
implementation  involves  two  processing  steps  since  both  recoding  and  addition  must  be  performed 
optically.  In  either  implementation  technique,  the  minimized  min  terms  are  stored  mring  a  holographic 
or  nonholographic  CAM  A  holographic  CAM  usually  suffers  from  its  inconvenient  recording  problems. 
On  the  other  hand,  a  nonholographic  CAM  provides  faster  processing  speed,  ease  of  alignment,  and 
programmability.  Since  a  few  minterma  is  required  in  the  proposed  technique,  therefore,  a 
nonholographic  CAM  may  be  used  for  storing  the  minimised  min  terms. 

In  this  paper,  we  presented  an  efficient  technique  for  parallel  trinary  signed-digit  arithmetic.  The 
proposed  technique  performs  parallel  carry-free  addition  and  borrow-free  subtraction  in  a 
time  in  two  steps  independent  of  the  operand  length.  This  technique  requires  the  minimum  number 
of  minterms  when  compared  to  the  previously  reported  trinary  signed-digit  arithmetic  fawhniq.n»«  [i, 
6].  Also,  in  this  technique,  more  information  can  be  incorporated  in  fewer  digits,  thereby  reducing  the 
system  size  and  complexity.  Finally,  a  CAM  baaed  optical  implementation  is  suggested  for  the 
proposed  scheme. 
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Abstract 

We  devise  a  methodology  for  optical  architectures.  We  present  the  general  de¬ 
scription  of  the  multi-port  optical  components.  Then  use  flow  graph  analysis  for 
these  components  to  analyze  optical  architectures  before  the  implementation  phase. 

It  illustrates  the  behavior  of  the  optical  data  signal  at  any  point  of  the  optical 
system. 

Summary 

Development  of  optical  components  have  witnessed  great  interest  in  the  last  few  decades 
[1].  These  components  were  successfully  used  in  optical  computing.  Experimental  results 
for  this  success  are  available  in  the  literature  [2]  and  [3].  As  far  as  we  know,  very  few 
researchers  have  discussed  modeling  of  these  optical  components  in  a  classical  manner  [4]. 
A  methodology  for  a  special  purpose  optical  architecture  has  been  developed  by  Murdocca 

[5]. 

Our  objective  is  to  develop  a  new  methodology  to  include  the  most  widely  used  op¬ 
tical  components  such  as  Fabry- Perot  interference  filter.  Then  we  extend  this  method 
to  complete  optical  architectures.  We  discuss  first  the  general  aspects  of  these  modeling 
techniques.  Then  we  apply  these  techniques  to  optical  architectures  that  widely  use  some 
of  these  components  [2]. 

In  practice  there  are  three  generic  classes  of  optical  network  architectures.  Those  that 
use  fiber-optic  interconnects  (fiber-optic  networks),  others  that  use  waveguides  (integrated- 
optic  networks),  and  third  that  use  free-space  interconnects  (free-space  optic  networks). We 
expect  that  this  methodology  will  be  suitable  for  all  these  categories. 

The  analysis  and  the  methodology  that  is  presented  follow  the  analysis  of  the  mi¬ 
crowave  networks,  the  similarities  stems  from  our  concern  with  the  travelling  waves,  in 
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which  the  so-called  scattering  parameters  are  useful.  These  parameters  constitute  the 
S-matrix  (scattering  matrix)  (4).  Similarly,  with  optical  components,  we  can  talk  about 
the  S-matrix.  Except  that  when  we  have  optical  components  we  should  introduce  two 
single  virtual  ports  instead  of  each  physical  port  .Therefore,  our  optical  scattering  matrix 
is  composed  of  elements  of  2x2matrices  rather  than  regular  numbers.  The  S-matrix  of 
some  common  optical  devices  such  as  the  one-port,  two-port  components  and  directional 
couplers  are  presented. 

Usually  virtual  ports  have  to  be  introduced  for  the  optical  components  in  addition  to 
the  existing  physical/real  ports.  This  is  necessary  in  order  to  account  for  the  reflections 
or  losses  of  these  optical  components.  Signal  flow  graphs  can  be  used  to  simplify  the 
results.They  are  also  used  to  represent  the  proper  equations  describing  the  components. 

We  have  developed  a  new  methodology  that  can  be  used  to  model  optical  architectures 
based  on  some  widely  used  optical  components  such  as  the  Fabry-Perot  interference  filter. 
Extension  of  this  methodology  to  large  optical  architectures  is  simple.  The  objective  is  to 
provide  a  tool  to  the  experimenter  to  deal  with  the  architecture  before  its  implementation. 
The  next  step  for  this  methodology  is  validation.  We  expect  to  validate  the  steps  of  this 
methodology  soon. 
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Abstract 

Optical  computing  is  an  emerging  field  that  makes  use  of  light  as  a  carrier  of 
information.  During  the  last  decade  numerous  architectures  have  been  proposed 
to  solve  problems  ranging  from  special  purpose  image  processing  to  general  pur¬ 
pose  optical  computers.  The  advantages  such  as  high  connectivity,  non-interference 
of  signals,  high  spatial  and  temporal  bandwidths,  low  signal  dispersion,  electro¬ 
magnetic  immunity,  massive  inherent  parallelism,  combined  with  analog  /  digital 
computational  capabilities  make  optical  computing  an  attractive  area. 

Despite  the  aforementioned  features,  the  discipline  of  optical  computing  remains 
an  ad  hoc  one.  One  of  the  primary  reasons  for  that  is  the  lack  of  significant  design 
tools  that  help  automate  the  design  process  which  is  ad  hoc  in  nature.  Tools  will 
also  promote  reusability  of  building  blocks  in  different  optical  systems. 

As  part  of  an  ongoing  project,  we  have  been  looking  into  the  various  aspects 
of  design  automation.  We  are  primarily  interested  in  developing  an  overall  design 
automation  system  for  optical  systems  to  assist  system  architects. 

In  this  paper,  a  design  for  a  Computer  Aided  Optical  Design  system  (CAOD) 
is  presented.  A  sketch  of  the  status  of  implementation  is  also  presented. 


1  Summary 

The  Computer  Aided  Optical  Design  CAOD  system: 

1.  is  a  workbench  of  software  tools  with  clearly  separated  functionalities  and  well 
designed  interfaces.  This  promotes  independent  evolution  of  these  tools 

2.  allows  optical  systems  designers  to  fully  exploit  and  use  the  special  characteristics 
of  optics 

3.  provides  a  language-centric  view  to  the  designer 

4.  provides  unified  semantics  across  different  interaction  paradigms 

5.  promotes  object-oriented  design  methodologies  for  the  development  of  optical  ar¬ 
chitectures 

6.  does  not  over  burden  physicists  with  excessive  programming  details 

7.  does  not  over  burden  system  architects  with  the  theoretical  foundations  of  optics 
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Figure  1.  shows  the  CAOD  system.  It  consists  of  a  number  of  software  tools  (repre¬ 
sented  by  boxed  items).  These  tools  interact  by  reading  from  and  writing  to  a  number 
of  files  (represented  by  elliptic  shaped  items).  An  architect  describes  the  optical  system 
under  study /development  using  either  PLOADS  (a  language  representation)  or  using  an 
interactive  3-D  graphics  editor.  The  translator  tool  takes  the  description  in  PLOADS  and 
produces  a  detailed  representation  of  the  architecture.  The  3D  graphics  editor  produces  a 
detailed  representation  as  well.  The  simulator  uses  the  detailed  representation  to  conduct 
simulation  and  produce  results.  The  design  rule  checker  verifies  user  defined  constraints 
and  reports  the  outcome  of  different  analyses.  The  optical  layout  tool  takes  the  specifica¬ 
tion  of  a  verified  architecture  and  translates  that  into  a  detailed  manufacturable  setup: 
its  importance  is  expected  to  grow  with  large  integrated  systems. 

The  component  database  contains  detailed  information  about  optical  components.  The 
component  library  is  a  collection  of  implementations  for  the  methods  that  model  the 
behavior  of  various  optical  components  types.  Mathematica  is  a  general  purpose  algebraic, 
numerical,  and  symbolic  computation  package.  Mathematica  is  included  in  the  CAOD 
system  to  simplify  the  implementation  of  complex  computations  that  arise  in  several  parts 
of  the  CAOD  system.  Whenever  any  sophisticated  computation  is  required,  services  from 
the  commercial  package  Mathematica  are  sought.  With  this  overall  view  of  the  CAOD 
system  in  picture,  subsequent  sections  discuss  the  details  of  the  different  CAOD  tools. 


Figure  1.  The  overall  CAOD  system 
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ABSTRACT 

A  new  method  able  to  evaluate  the  behaviour  of  an  optical  non-linear  device,  used  as  logic  gate, 
based  on  fractal  dimension  tools,  is  reported.  A  real  example  is  shown. 


SUMMARY 


It  is  already  well  know  that  using  an  optical  bistable  device  as  a  logic  gate,  we  have  four  possible 
configurations.  Each  one  of  them  give  us  a  different  logic  function. 

This  logic  function  will  be  obtained  from  two  binary  input  data.  A  bias  signal  will  allow  to  control  the 
decision  level  of  the  non-linear  characteristic*11. 

One  of  the  most  important  parameters  of  a  logic  gate  are  margin  tolerances  of  the  different  signals. 
They  let  the  logic  gate  to  operate  correctly  on  the  sense  of  the  expected  function.  These  parameters  are 
difficult  <o  define  for  an  optical  non-linear  device  due  to  the  device  dependence  on  wavelength,  temperature 
and  others.  We  propose  a  method  to  know  the  precision  level  of  data  and  bias  signals  on  an  specific  logic 
gate,  by  two  measurements  of  every  one  of  the  possible  outputs.  This  method  will  be  described  in  this  work. 

As  an  example  of  precision  measurement  by  fractal  dimension  we  are  going  to  analyze  one  of  the 
most  complicated  configurations  of  an  optical  bistable  device  as  a  logic  gate  (Fig.  1).  Its  difficult  comes  from 
the  output  characteristics.  It  corresponds  to  the  output  characteristic  of  a  SEED. 

The  method  is  based  in  the  generation  of  a  data  chart  that  indicates  the  various  logic  functions  that 
can  be  obtained.  The  y-axis  corresponds  with  the  bias  signal  values  and  the  x-axis  with  the  expected  value 
of  the  decision  level  D,  normalized  by: 


where  I  is  the  intensity  normalized  value  of  a  bit  "1"  at  the  input,  i  is  the  total  intensity  at  the  input  (sum  of 
the  two  data  signals  plus  the  bias  signal). 

The  total  chart  for  all  the  possible  outputs  has  been  presented  as  part  of  the  result  of  an  Optically 
Programmable  Processing  Element  and  it  has  been  reputed  previously  [2]-[3j.  We  just  present  here  an  small 
area  of  the  total  chart,  6&2.  This  figure  has  been  obtained  by  computer  simulation  of  the  ideal  characteristic 
shown  in  figure  l.c.  In  this  example  we  have  considered  that  the  distance  between  the  different  decision 
levels  are  the  same.  So  D,  =  d,  but  h  can  be  taken  any  other  value. 

Figure  2  shows  the  logical  function  obtained  with  a  precision  level  of  the  bias  signal  of  0.1  and  a 
decision  level  of  0X11.  On  figure  3,  the  precision  level  of  each  axis  has  been  increased  separately:  a)  0.01  - 
0.01;  b)  0.1  -  0.001.  Figure  4  shows  the  result  for  a  better  precision  on  both  axis:  bias  signal  -  0.01;  decision 
level  •  0.001.  These  figures  show  bow  the  image  is  repeated  as  the  precision  gets  higher.  It  corresponds  with 
the  type  of  fractales  that  can  be  found  in  the  nature. 

The  fractal  dimension  calculated  by  box-counting  method,  for  the  different  charts  shown  in  figures 
2-4,  can  be  summarized  on  the  table. 
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TABLE  L-  Parameter  values  for  fractal  dimension. 


S 


0.01/0.001 


0.01/0.01 


0.9145 


0.8817 


0.8451 


In  this  table  we  can  see  the  direct  relation  between  the  precision  level  of  the  data  on  each  axis  and 
the  value  of  the  fractal  dimension.  So  if  two  data  charts  of  a  logic  gate  function  are  obtained  by  direct 
measurements  of  input  and  output  signals,  the  fractal  dimension  will  give  the  precision  of  the  logic  gate.  And 
this  precision  would  be  better  as  nearest  the  value  is  to  one. 
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ABSTRACT 

Digital  chaotic  behaviour  in  an  Optically-Processing  Element  is  reported.  It  is  obtained  as  the  result 
of  processing  two  fixed  train  of  bits.  Period  doublings  in  a  Feigenbaum  scenario  have  been  obtained. 

SUMMARY 

The  employed  optically- programmable  digital  circuit  has  been  already  reported  in  (l)-(2)  as  a 
Programmable  Logic  Gate.  A  brief  description  on  its  method  of  operation,  as  well  as  the  way  it  has  been 
implemented,  can  be  fund  there. 

Our  circuit  is  composed  by  two  optical  devices,  P  and  Q,  (see  Figure)  with  a  non-linear  behaviour. 
The  output  of  each  one  of  them  corresponds  the  two  final  outputs,  O]  y  Oj,  of  the  cell.  The  possible  inputs 
to  the  circuit  are  four.  Two  of  them  are  for  the  input  data,  I,  e  I*  and  the  other  two,  g  and  h,  for  control 
signals.  The  corresponding  inputs  to  the  non-linear  devices  are  based  mi  these  signals  plus  some  other 
coming  from  inside  the  own  cell. 

The  practical  implementation  we  have  carried  out  of  the  processing  element  has  been  based  mi  an 
optotectronicsl  configuration.  Lines  in  Figure  represent  optical  multimode  fibers.  The  indicated  Mocks,  placed 
in  order  to  combine  the  corresponding  rignak,  are  conventional  optical  couplers.  In  this  way,  the  inputs 
arriving  to  the  individually  devices  are  multilevel  signals. 

Moreover,  for  the  results  we  are  going  to  report  here,  a  computer  simulation  has  been  carried  out. 

The  main  difference  of  our  present  circuit  with  respect  to  previous  similar  logical  circuits  is  the 
presence  of  additional  delays  and  feedbacks. 

The  first  difference  has  been  the  introduction  of  a  feedback  from  one  of  the  two  possible  outputs 
to  one  of  the  system  inputs.  Moreover,  according  to  previous  studies  in  this  field,  the  introduced  feedback 
should  have  some  delay.  An  internal  delay  has  to  be  taken  into  account  because  the  response  time  of  the 
optoelectronic  devices.  In  this  case,  an  oscillatory  behaviour  could  be  the  normal  output  of  the  system.  But, 
under  some  conditions,  this  is  not  always  true.  The  output,  as  we  win  shown,  is  not  periodic  for  some 
parameters  of  the  system.  Some  of  these  situations  will  be  shown  in  this  paper. 

In  order  to  chose  the  more  adequate  feedback,  two  are  the  possibilities.  Because  the  P-devke  output 
has  more  possible  different  functions  depending  on  hs  control  signal  than  the  Q-,  we  have  used  this  output 
for  feedback.  The  input  is  the  control  signal  g  of  device  P.  The  figure  shows  the  employed  circuit. 

If  we  consider  an  inner  delay  time,  owing  to  the  switching  time  of  internal  device,  plus  a  feedback 
delay  time,  we  find  different  results  depending  on  the  rate  between  both  delays.  In  all  cases,  we  find  an 
asymptotic  behaviour  of  the  circuit.  Some  of  the  obtained  results  have  beat  achieved  by  experimental 
methods  and  some  others  by  computer  simulation.  Optoelectronic  cells  simulate  P  and  Q  devices. 

The  first  analysis  that  we  have  performed  considered  null  delay  times.  This  situation  has  not  an 
algebraic  solution  and  no  data  were  obtained.  The  circumstances  are  strongly  different  if  we  introduce  finite 
delay  times,  namely,  internal  and  external  delays. 

According  to  previous  studies,  the  situation  with  more  probability  to  give  a  periodic  or  even  chaotic 
solution  is  when  internal  time  delay  is  shorter  than  the  external  one.  In  any  case,  input  has  been  a  regular 
train  of  pukes.  The  input  to  the  non-linear  device  is  a  multilevel  signal  corresponding  to  the  addition  of  the 
two  inputs.  The  period  of  this  signal  corresponds  to  a  time  of  14  milliseconds. 

If  the  ratio  between  internal  delay  time  and  external  delay  time  is  smaller  than  1,  we  obtain  a 
periodic  situation.  The  period  of  this  signal  is  strongly  dependent  on  the  ratio  value.  In  the  particular  case, 
where  external  time  delay  is  200  ms  and  internal  delay  are  2, 4  and  12  ms,  obtained  results  are  summarized 


Flgare.-  Optkat-Prograaabte  Logic  Circuit  with  Feedback. 


in  Tabic  I.  An  interesting  resub  is  the  duplicaticm  in  period  time  when  the  ratio  between  delays  get 
smaller.  It  goes  from  70  to  280.  Hence,  we  have  obtained  frequency  doubling.  And  this  is  one  of  the  best 
indications  of  the  route  to  chaos. 

TABU  I.-  Characteristics  of  thn  output  signals,  according  to  tha  daisy 
timas. 


T. 

Tt 

Ti/T. 

Period 

14 

200 

2 

0.01 

280 

14 

200 

4 

0.02 

140 

14 

200 

12 

0.06 

70 

Values  given  at  Table  I  do  not  correspond  to  the  transition  points  between  different  periods.  They 
are  in  a  range  where  period  remains  constant  If  we  calculate  the  equivalent  to  the  Feigenbaum  ratio  for  the 
indicated  values  we  obtain  4. 

But  if  higher  order  transition  points  are  taken  into  account,  a  number,  closer  to  4.6692,  has  been 
have  been  obtained. 

As  it  is  can  be  seen  in  Table  I,  as  internal  delay  time  goes  to  smaller  values,  the  period  of  the  output 
signal  get  higher  and,  eventually,  becomes  chaotic  This  situations  has  been  just  obtained  by  computer 
simulation  with  internal  delay  time  zero.  Experimentally,  we  have  not  been  able  to  obtain  k. 

The  above  facts  show  that  optically  processing  elements  arc  good  candidates  as  pulse  generators  with 
controlled  periods  tod,  if  die  computer  results  me  obtained  experimentally,  a  random  sequence  of  optical 
pulses  can  be  achieved. 

REFERENCES 


1. -  J.A.Martin-Pereda  and  A.Gonz41cz-Marcos,  SPIE,  VoL  2038.  pp.  67-77.  (1993) 

2. -  A.Gonz&iez-Marco6,  PhD  Thesis.  1993. 


MPS4/135 


Architectures  for  Photonic  Space-and  Wavelength-Switching  Using  Collinear 

guided-wave  Acoustooptics. 

Proklov  V.V.,  Bashlakov  A.U.  and  Biijukov  VA 

Institute  Radio  Engineering  and  Electronics  of  the  Russian  Academy  of  Sciences, 
Moscow,  103907,  Mochovaya  11,  GSP-3,  Russia 

telephone:(095)-526  91  92 
Fax:  (095)-203  84  14 
E-mail:  bys3249ire216.msk.SU 

Abstract 

This  work  presents  a  new  approach  for  algorithms  and  architectures  for 
photonic  switching  in  mixed  space-wavelength  domains  by  means  of  collinear 
guided-wave  acoustooptic  interaction.  We  investigate  a  few  most  advanced 
schemes  and  proposed  capabilities  for  IiNbC>3  guided-wave  implementations  in 
optical  computation  and  associative  memory  devices. 

Summary 

There  were  known  a  different  wavelength  -  division  -  multiplexing  and  space- 
division-multiplexing  systems  to  perform  the  optical  switching  with  increased 
transmission  capacity  and  system  flexibility  (see,  for  example  [1]).  In  each 
mentioned  case  it  has  bear  used  a  specific  property  of  required  devices  -  the 
wavelength  selectivity  or  the  spatial  resolution. 

In  this  paper,  we  investigate  an  improvement  of  the  guided-wave  photonic 
switching  devices  by  means  of  utilizing  of  the  combined  space-wavelength  divisions 
multiplexing.  The  principal  idea  is  based  on  earlier  demonstrated  switching 
capability  of  the  multifrequency  acoustooptic  (AO)  interaction  of  the  light  guided 
modes  with  collinear  propagated  surface  acoustic  waves  (SAW)  in  planar  Ti- 
dxSUsed  waveguide  in  YX-LiNb03  that  results  in  the  radiative  substrate  mode  with 
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90°  -rotation  of  light  plane  polarization  |2].  Fig.  1  shows  the  scheme  of  this  kind 
of  single  channel  planar  AO  cell  with  estimated  switching  capability  in  space- 
wavelength  divisions  about  ~  1011  switch. /sec. 

In  this  paper  we  consider  a  few  possible  architectures  and  algorithms  for  ID 
and  2D  photonic  switching  in  space-wavelength  domains  based  on  the  mentioned 
multifrequency  and  multichannel  planar  AO  cells.  It  gives  also  detailed  analysis 
experimentally  achieved  and  potential  (extrapolated)  characteristics  of  this 
technology  for  aims  of  optical  computing  and  optoelectronic  associative  memory 
systems  (with  estimated  capability  ~  1014  switch./sec  for  one  chip  device  similar  to 
shown  in  Fig.  2). 


Fig.l.  Single  channel  planar  AO  cell  Fig.  2.  Scheme  of  multifrc 

based  on  Ti:IiNb03  with  collinear  quency/ multichannel  planar  AO  cell 

wave  interaction.  for  associative  memory  system. 
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ABSTRACT 

The  use  of  a  bi-cell  photodiode  detector  in  an  acousto-optic  FM  demodulator  is  described.  Theory  is 
developed  showing  that  the  system  is  capable  of  up  to  57  dB  AM  rejection  and  practical  results  are 
presented  to  support  die  theory. 


SUMMARY 

The  acousto-optic  FM  demodulator  [1,2]  is  a  conceptually  simple  and  yet  highly  sensitive  system  for 
real-time  demodulation  of  narrow  band  FM  signals  at  carrier  frequencies  up  to  around  2  GHz.  The 
basic  system  is  shown  in  Figure  1.  The  system  depends  upon  the  frequency  dependent  deflection  (60) 
of  a  laser  beam  (wavelength  X)  by  an  acoustooptic  cell  (acoustic  velocity  Wg),  given  by 

66  =  ^-fif  (1) 

where  5f  is  the  change  in  frequency. 

In  the  original  system  a  angle  detector  is  used  and  the  deflected  spot  of  light  scans  across  a  knife 
edge  so  that,  as  die  frequency  changes,  so  does  the  amount  of  light  reaching  the  detector.  This 
system  is  highly  sensitive  to  any  AM  content  in  the  signal  and  also  to  any  variations  in  the  intensity 
of  the  laser.  The  AM  content  can  be  removed  by  hard  limiting  but  the  laser  intensity  noise  remains  a 
problem. 


modulated 
signal  In 


Figure  1.  The  Acousto-Optic  FM  Demodulator  -  Schematic 
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In  the  mw  system  the  knife-edge/single  detector  assembly  is  replaced  by  a  bi-cell  detector 
comprising  two  photodiodes  with  a  very  small  inactive  strip  between  them.  This  strip  is 
perpendicular  to  the  direction  of  beam  deflection  so  that  the  edge  of  each  diode  acts  as  a  knife  edge. 
For  small  deviations  of  the  gaussian  beam  ( :£  half  the  standard  deviation)  the  optical  power  reaching 
the  two  detectors  is 

P+  -  |-(1+K)  and  P_  =  |-(1-K)  (2) 

where  Pj  is  the  power  in  the  first  order  diffracted  beam  and  K  is  the  ratio  of  the  signal  power  (Pg) 
due  to  due  movement  of  the  beam  to  the  background  power  (Pj/2).  It  should  be  noted  that  it  is  the 
quantity  K  that  gives  the  frequency  modulation  since  it  may  be  shown  that 

K  =  Jf  (3, 

’l 

where  d  is  the  1/e^  laser  beam  width  at  the  AO  cell. 


Taking  the  difference  of  P+  and  P.  we  obtain  an  output  proportional  to  PiK.  Since,  for  linear 
response,  K  <  <  1  we  can  see  that  variations  in  the  diffracted  optical  power  Pj  have  a  far  larger 
effect  upon  the  output  of  the  single  detector  system  (P+)  than  on  the  output  of  die  bi-cell  system. 
The  AM  rejection  of  the  bi-cell  system  relative  to  the  single  detector  system  may  be  shown  to  be 


1 +K 

2K  *  2K 


for  K  <  <  1  (4) 


Using  d  =  O.S  mm  and  Va  =  617  m/s,  we  find  that  the  AM  rejection  of  the  bi-cell  system  is 


57  -  10  log(£f)  dB 


(5) 


Clearly  the  AM  rejection  is  very  good  for  small  values  of  dt  Even  at  the  limit  of  the  linear  range  of 
this  system  (5f  =  200  kHz),  the  AM  rejection  is  4  dB.  Theoretical  predictions  of  this  model  are 
presented  and  compared  with  experimental  results  ami  are  found  to  be  in  good  agreement.  Although 
the  performance  of  the  system  falls  off  as  the  frequency  deviation  increases,  its  main  advantage  lies 
in  its  simplicity.  The  AM  effects  discussed  arise  from  both  the  AM  content  of  the  signal  and  any 
laser  intensity  variations.  If  this  technique  is  combined  with  a  hard  limiter  on  the  input  signal  the 
resulting  system  performance  is  found  to  be  significantly  better  than  the  knife  edge/single  detector 
system.  An  alternative  technique,  which  would  completely  remove  the  AM  content,  is  to  split  the 
diffracted  beam.  One  portion  goes  to  die  knife  edge/ single  detector  and  foe  other  to  a  detector  which 
measures  only  the  AM  content.  Dividing  foe  first  signal  by  half  foe  second  yields  the  FM  signal. 
However  with  currently  available  AO  cells  the  system  is  capable  of  operating  at  modulation 
bandwidfos  up  to  10  MHz;  at  these  frequencies  analogue  dividers  do  not  perform  well. 
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Abstract 

The  performance  of  an  8x8  polarization-independent 
LiNb03  switch  matrix  formed  by  directional  couplers  for 
the  local  area  network  is  described.  The  cross-talk  of  the 
coupler  is  precisely  estimated  and  its  minimization  is 
discussed. 


We  report  the  performance  of  a  8x8  polarization-independent 
LiNb03  switch  matrix  on  the  base  of  a  directional  coupler  with  its 
full  complement  of  the  local  area  network.  The  objective  of  this 
work  was  to  provide  an  Mg0:Ti:LiNb03  switch  array  on  the  base  of 
a  new  type  directional  coupler  for  use  as  a  photonic  center  stage 
switch  in  the  architecture  switching  network. 

In  designing  such  schemes  the  problem  of  cross-talk 
minimization  holds.  In  the  given  work  a  dependence  of  the  switch 
cross-talk  on  the  adjusting  section  parameters  is  analyzed.  An 
adjusting  section  scheme  which  permits  to  eliminate  the  cross-talk 
is  proposed.  We  investigate  the  device  on  the  basis  of  diffused 
channel  waveguides  where  modes  of  the  waveguides  are  described 
approximately  by  the  scalar  equation 


dV +aV+aV  + 

Ac2  3 fy2  dz2 


kQWxyj)  *0 


where  V  denotes  the  light  amplitude  in  the  waveguide  and  ko  the 
wave  number.  This  equation  is  solved  for  y<0  with  the  boundary 
condition  Vty-0-0. 

The  optical  modulators  used  in  the  experiment  are  high-speed 
waveguide  electro-optic  directional  couplers.  The  fabrication 
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parameters  for  the  single-mode  waveguides:  titanium  stripe  width 
4pm  and  coupler  length  L-8mm.  For  this  device  more  than  80% 
efficiently  was  obtained  (wavelength  is  1.3pm  and  12V  switching 
voltage,  -29dB  "bar"  state  crosstalk  and  -19dB  "cross"  state 
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ABSTRACT 

Various  near  field  patterns  of  a  photorefractive  waveguide  fabricated  in  a  lithium 
niobate  crystal  are  compared  with  those  of  numerical  analysis  using  a  waveguide 
model  to  evaluate  the  optical  characteristic  of  photorefractive  waveguides. 

SUMMARY 

We  present  in  this  paper  experimental  and  numerical  analysis  of  a  novel 
approach  to  optical  interconnections  for  neural  networks.  The  approach  is  to 
fabricate  waveguide  structures  in  photorefractive  media  by  simply  focusing  and 
scanning  a  laser  beam  (see  Fig.  1).  Resultant  waveguides  with  variable  index 
profiles  may  be  used  for  the  optical  dynamic  interconnections.  We  believe  in  the 
possibility  that  high-density  optical  interconnections  with  a  self-organization 
ability  be  realized  by  this  approach.  Experimental  evaluation  of  simple  photore¬ 
fractive  waveguides  has  been  made1'2*.  In  this  paper,  optical  characteristic  of  a 
photorefractive  waveguide  in  a  lithium  niobate  (LN)  crystal  is  studied  experi¬ 
mentally  and  numerically.  Comparison  between  the  results  of  the  experiment 
and  numerical  analysis  showed4*  that  the  maximum  photorefractive  change  of 
refractive  index  is  as  large  as  lxlO"3-  Results  of  modal  analysis  will  also  be  pre¬ 
sented. 

The  experimental  setup  is  briefly  outlined.  A  linearly  polarized  Ar-ion 
laser  beam  whose  electric  field  vector  and  propagation  direction  are  perpen¬ 
dicular  to  the  c-axis  of  a  LN  crystal  is  focused  by  a  microscope  objective  lens  to 
fabricate  the  waveguide  structures  in  the  LN  crystal.  A  He-Ne  laser  beam  whose 
electric  field  vector  is  polarized  along  the  c-axis  is  also  provided.  This  beam 
causes  a  negligible  photorefractive  effect  and  is  used  to  excite  the  guided  modes 
in  the  fabricated  waveguides.  The  longitudinal  position  of  the  focus  of  the  fabri¬ 
cation  beam  can  be  controlled  by  a  translator.  A  microscope  objective  is  placed 
behind  the  LN  crystal  for  observation  of  near  field  patterns  on  the  rear  face  of  the 
crystal.  The  near  field  patterns  are  observed  by  a  CCD  image  sensor. 

We  observed  the  lateral  profile  of  refractive  index  change  around  the 
waveguide1'2*.  To  evaluate  the  refractive  index  change  within  the  crystal,  we 
have  mad »  a  model  of  photorefractive  index  distribution  within  the  crystal.  In 
this  model,  the  maximum  refractive  index  change  is  an  unknown  parameter. 
We  observed  also  the  near  field  patterns  across  the  rear  face  of  the  crystal1'2*.  We 
have  calculated  the  variation  of  intensity  distribution  of  guided  beams  along  the 
waveguide  (Figs.  3  and  4)  and  near  field  patterns  across  the  rear  face  of  the  crystal. 
Comparison  between  the  experimental  and  numerical  results  made  us  possible 
to  estimate  the  maximum  value  of  refractive  index  change4*. 

In  this  paper,  we  compare  the  results  of  a  more-detailed  experiment  with 
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those  of  numerical  analysis  to  estimate  precisely  the  refractive  index  distribution 
within  the  crystal  by  using  more  realistic  model  for  the  refractive  index  change. 
We  also  observe  the  variation  of  near  field  pattern  when  the  amplitude  distribu¬ 
tion  of  the  excitation  beam  across  the  entrance  of  the  waveguide  is  changed. 
Such  observation  gives  us  the  information  about  the  propagation  characteristic 
of  the  waveguide.  Comparison  of  these  observations  with  the  numerical  results 
will  improve  the  accuracy  of  estimation  of  index  distribution  and  optical  charac¬ 
teristics  of  the  waveguide.  Examples  of  the  side  views  of  the  calculated  intensity 
distribution  along  a  curved  waveguide  are  shown  in  Figs.  3  and  4.  The  maxi¬ 
mum  values  for  the  refractive  index  change  are  assumed  to  be  1. 2x10" 3  and 
0.9X10*3,  respectively. 


0  50  100  150  200 


Objective  Lens  waveguide  Position  along  the  C-axis  [  nm  ] 


Fig.  1  Fabrication  of  3-D  Fig.  2  Near  field  pattern  across 

photorefractive  waveguide.  the  rear  face  of  the  crystal. 


MP58/143 


FManhctiw  Ckaaad  Waveguide  Stractares  for  Optical  Coapatiag  AppUcatioas 

Vladymir  M.  Shandarov 
Laboratory  of  Acoustooptoelectronics 
State  Academy  of  Control  Systems  and  Radioelectronics 
40  Lenin  Ave.,  Tomsk  634050,  Russia 
Phone:  007-<3822)-496278,  Fax:  007-(3822)-223262 

Abstract 

Characteristics  of  channel  optical  waveguide  structures,  formed  in  lithium  niobate  by 
photorefiractive  impurity  diffusion,  have  been  investigated,  and  two  •  dimension  matrix  of 
such  waveguides  for  optical  computing  devices  has  been  proposed. 


Besides  bulk  photorefractive  crystals,  recently  some  investigations  of  photorefiractive 
effects  in  fiber  -  shape  samples  and  planar  waveguides  on  the  base  of  such  crystals  have  been 
performed  [1  -  3].  They  showed  some  advantages  of  similar  fiber  and  waveguide  elements 
compared  with  bulk  materials  because  of  the  high  optical  energy  density  within  them  and 
easier  changing  of  their  photorefractive  properties.  It  resulted,  in  particular,  in  proposal  to 
use  the  photorefiractive  fiber  bundles  as  the  two  -dimension  matrix  in  optical  storage  devices 
and  pixel  -  by  -  pixel  image  processors  [1].  Because  channel  optical  waveguides  and  optical 
fibers  have  many  similar  features,  the  photorefractive  channel  waveguide  structures  can  be 
used  in  such  a  manner  as  well.  In  this  work  some  technological  features  and  photorefiractive 
properties  for  lithium  niobate  single  -  mode  and  multi  -  mode  channels  were  investigated 
and  the  one  of  possible  channel  structure  configurations  for  optical  computing  has  been 
proposed. 

1.  Fabrication  of  dm— H  mmpides. 

The  single  channels  and  channel  arrays  were  formed  both  on  the  Y  and  Z  cut 
LJNb03  substrates.  To  increase  the  photorefractive  sensitivity  of  LiNb03,  the  Fe  diffusion 
and  the  sequential  II  and  Go  (or  Fe  and  Co)  diffusion  were  used  (because  Co  diffusion 
only  results  in  a  negative  change  of  UNb03  refraction  index).  To  provide  more 
symmetric  depth  profile  of  channels,  some  of  them  were  formed  by  the  counterdirected  Fe 
diffusion  in  the  substrates  with  thickness  0.1  mm.  The  initial  film  thicknesses  of  doping 
impurity  films  on  the  substrate  surface  made  up  20,0  -  30,0  nm  for  single  -  mode  and  up 
to  200,0  nm  for  multi  -  mode  channels.  The  diffusion  process  was  carried  out  in  the  air 
atmosphere  at  900  -  1000°  C  for  time  period  from  2  up  to  40  hours.  The  width  of  metal 
strips  was  changed  within  the  range  from  5,0  mcm  up  to  0,1  mm  for  single  -  mode  and 
multimode  waveguides,  respectively.  To  use  samples  obtained  in  experiments,  their  input 
and  output  end  faces  were  polished  with  optical  quality.  As  waveguide,  as 
photorefractive  characteristics  of  channel  structures  were  tested  at  the  light  wavelenght  633 
nm  (He  -  Ne  laser). 

2.  Ckaaad  deaeat  characteristics. 

The  waveguide  parameters  of  channels  formed  were  determined  from  modal 
characteristics  inspected  by  a  prism  coupling  method  for  multi  -  mode  elements  and  from 
near  field  distributions  for  single  -  mode  ones.  Their  photorefractive  characteristics 
were  investigated  by  hologram  recording  in  the  common  two  -  beam  setup  (Fig.l). 

Characteristics  of  waveguides  produced  considerably  depended  on  doping 
impurity  kind  and  channel  waveguide  geometric  parameters.  Thus,  the  best  result 
(combination  of  optical  quality,  photorefractive  sensitivity  and  time  response 
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characteristics)  for  single  -  mode  channels  was  obtained  for  only  Fe  diffusion.  The 
refractive  index  change  at  the  waveguide  boundary  was  measured  in  this  case  0.005  - 
0.007  and  the  waveguide  depth  was  dose  to  its  width  value  5,0  mem.  The  typical  time 
response  at  photorefractive  grating  recording  and  erasure  made  up  about  1  s  for  such 
waveguides.  The  main  feature  of  multi  -  mode  LiNh03:Fe  waveguides  is  the  inhomogeneity 
of  Fe  distribution  in  the  channel  cross  section,  that  results  in  the  strong  dependence  of 
photorefractive  grating  recording  and  erasure  characteristics  on  the  input  light  field  space 
structure.  These  characteristics  were  improved  in  waveguides  produced  by  the 
counterdirected  Fe  diffusion  in  0,1  mm  Z  cut  substrates.  The  Fe  films  with 
thicknesses  from  100,0  up  to  200,0  nm  were  used  in  this  case.  It  should  be  noted  the 
different  conditions  for  light  definition  in  such  waveguides  in  the  horisontal  and  vertical 
directions,  that  has  to  be  taken  into  attention  for  image  processing  in  such  elements. 

3.  Applications  of  channel  photorefractive  wavegaide  stractares  in  optical  comparing. 

Like  the  fiber  photorefractive  elements,  the  channel  structures  discussed  have  some 
advantages  compared  with  bulk  crystals.  They  allow  nonlinear  interactions  of  light  beams 
at  low  optical  powers,  easy  to  control  the  photorefractive  characteristics  of  separate 
elements  in  the  whole  structure  and  their  spatial  configuration.  Besides,  many  wafers 
with  channel  arrays  can  be  stacked  in  the  2D  matrix  for  optical  storage  devices  and  other 
processors  (Fig.2)  with  possibility  to  provide  any  required  dependence  of  photorefractive 
properties  in  the  cross  section  of  such  an  artificial  "bulk  photorefractive  cell". 


Fig.  1  Experimental  setup  for  inspections 
of  photorefractive  characteristics  in 
channel  waveguides 


Fig.2.  Schematic  of  20 

matrix  from  channel 
waveguide  arrays 
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Abstract. 

The  method  of  structural  organization  of  fiber-optical  memory 
device  (FOMD)  is  proposed  which  allows  effective  solutions  to  the  problems 
of  data  addressing,  control  and  correction  taking  the  peculiarities  of 
FOMD' s  into  account.  By  this  purpose  two  contrary-directed  circulation 
channels  -  main  and  built-in  additional  ones  -  are  organized  in  a  single¬ 
fiber  FOMD  loop. 


A  number  of  features  peculiar  to  fiber-optical  memory  devices1’? 
namely,  high  signal  circulation  rate,  incontrollability  of  signals  in  op¬ 
tical  channel,  dependence  of  their  time  position  on  various  destabilizing 
factors,  hamper  the  use  of  conventional  methods  of  data  synchronization, 
addressing  and  control  employed  in  other  memory  types. 

This  paper  presents  a  method  of  structural  organization  of  fiber- 
optical  memory  loop  which  allows  effective  solutions  to  the  problems  of 
data  addressing,  control  and  correction  taking  the  peculiarities  of  FOMD’s 
into  account. 


Figure  1.  PD  -  photodetector,  M  -  modulator,  L  -  laser,  IOU  -  input/output 
unit,  FC  -  fiber  coupler,  CG  -  clock  generator,  A F  -  address  former,  DC  - 
digital  comparator. 
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In  accordance  with  the  Method  proposed,  two  contrary-directed  circu¬ 
lation  channels  -  sain  and  additional  ones  -  are  organized  in  a  single- 
fiber  FOMD  loop  by  using  fiber  couplers. Inf ormat ion  signal  sequence  circu¬ 
lates  in  the  sain  channel,  while  the  additional  channel  can  be  used  to 
transmit  various  auxiliary  signals  (verifying,  synchronizing  and  addres¬ 
sing  signals). 

To  ensure  a  reliable  access  to  any  segment  of  circulating  information 
sequence  and  to  improve  its  storage  security,  the  additional  channel  can 
be  used  as  an  addressing  channel.  In  the  writing  mode  (see  figure  1),  the 
Information  sequence  {b#i>  and  the  addressing  sequence  {c^>  are  written 

into  the  loop  memory  be  feeding  them  synchronously  to  the  information 
(inf. input)  and  auxiliary  (aux. input)  input  terminals  of  the  Input/output 
unit  (IOU),  respectively,  and  further  to  control  input  terminals  of  the 
corresponding  electrooptical  modulators.  As  the  addressing  sequence  cir¬ 
culates  in  closed  loop,  its  segments  (address  words)  are  also  synchronous¬ 
ly  written  into  an  n-bit  address  former  (AF)  thereby  forming  the  address 
of  a  coming  information  sequence.  Synchronous  circulation  of  signal  flows 
along  identical  channels  allows  simultaneous  feeding  of  the  IOU  input 
terminals  independent  of  any  external  influences. 

Access  to  the  required  information  signal  (rewriting,  retrieval)  is 
realized  through  setting  appropriate  address  on  the  address  inputs  of 
a  digital  comparator  (DC).  When  the  contents  of  the  AF  coincides  with  the 
set  address  during  the  next  passage  of  the  address  sequence  along  the 
circulation  loop,  the  DC  generates  a  control  signal  used  to  strobe 
Information  signals. 

In  order  to  reduce  error  probability  in  forming  address  words,  it  is 
advantageous  to  use  a  pseudo-random  sequence  (PRS)  as  address  sequence. The 

PRS  length  is  K  *  2*  -1,  where  M  is  the  PRS-generator  bit  capacity,  and 
is  determined  by  the  memory  information  capacity;  the  address  former 
represents  an  n-blt  shift  register,  where  n  is  not  less  than  N.  In  case  a 
redundant  length  of  the  PRS  address  words  is  used,  i.e.  for  n  >  N, 
addressing  is  free  of  errors  even  if  the  received  address  word  has  several 
incorrect  ("spoilt")  bits. 

Transmission  of  verification  and  synchro  signals  in  the  additional 
channel  results  in  enhanced  reliability  of  data  storage  owing  to 
correction  of  possible  errors  and  elimination  of  mistiming  in  synchronous 
detection  of  signals3.  In  addition,  by  using  non-linear  optical  effects  it 
is  possible  to  realize  operating  mode  ensuring  compensation  of  attenuation 
of  information  signals  in  the  optical  fiber. 
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Basic  properties  and  possible  applications  to  optical 
computing  of  new  types  of  spatial  and  spat lot emporal 
solitons  and  their  bound  structures  are  considered  in  the 
following  nonlinear  optical  schemes:  (a)  medium  with 
nonlinearity  of  refractive  index;  (b)  wide-aperture 
nonlinear  interferometer;  (c)  wide-aperture  laser  with 
saturable  absorption;  (d)  optical  fiber  with  sections  of 
gain  and  absorbing  media. 

Soliton-like  light  structures  of  different  types 
(spatial  and  spatiotemporal  optical  solitons)  can  be  formed 
in  a  wide  spectrum  of  nonlinear  optical  systems:  passive  and 
active,  with  and  without  feedback.  A  number  of  their 
properties  promising  for  optical  computing  differ  noticeably 
from  the  case  of  “traditional  solitons"  observed,  for 
instance,  in  single-mode  fibers  with  Kerr  nonlinearity.  In 
the  present  communication  we  present  results  of  theoretical 
investigations  and  computer  simulations  of  basic  properties 
of  soliton-like  structures  in  four  different  nonlinear 
optical  systems  and  consider  their  possible  applications  for 
processing  of  optical  information  and  optical  computing. 

In  transparent  medium  with  saturable  nonlinearity  of 
the  refractive  index,  formation  of  different  kinds  of 
spatial  solitons  is  possible,  including  solitons  with 
different  topological  charges  (vortices).  Single  spatial 
solitons  are  characterized  by  azimuthal  (topological  charge) 
and  radial  integer  indexes,  and  by  the  propagation  constant, 
which  can  vary  continuously  in  some  interval.  As  a  result, 
there  is  a  set  of  solitons  with  continuous  variation  of 
their  characteristics  (e.g.,  maximum  intensity) .  In  the 
case  of  copropagation  of  a  pair  of  solitons,  depending  on 
characteristics  of  single  solitons,  initial  distance  between 
them  (in  the  transverse  direction),  angle  between  axes  of 
propagation  and  phase  difference,  the  following  regimes  can 
be  realized:  oscillations  of  the  transverse  distance  between 
solitons  with  increase  of  the  longitudinal  coordinate, 
elastic  and  inelastic  collision  (with  transfer  of  power), 
decay  of  solitons  into  fragments;  merging  of  solitons  into 
one  with  larger  power.  Dynamics  of  interaction  of  laser 
beams,  initial  profile  of  which  differs  from  a  spatial 
soliton,  depends  on  the  degree  of  nonlinear  distortions  of 
these  beams  before  their  collision  in  the  medium. 

In  passive  wide-aperture  nonlinear  interferometers 
excited  by  homogeneous  coherent  external  radiation,  a  wide 
set  of  spatial  solitons  (diffractive  autosolitons)  and  their 
bound  states  can  exist.  In  contrast  to  "traditional 
solitons"  which  have  continuous  spectrum  of  widths  and 
amplitudes,  diffractive  autosolitons  have  discrete  set  of 
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widths  and  a  certain  value  of  amplitudes.  Bound  states  of 
diffractive  autosolitons  are  also  characterised  by  a 
discrete  set  of  distances  between  them.  Transverse  (with 
respect  to  the  direction  of  the  wave  vector  of  radiation) 
motion  of  solitons  and  their  symmetric  structures  is 
possible  only  tfith  oblique  incidence  of  radiation  on  the 
interferometer.  The  property  of  asymmetric  structures  of 
bound  diffractive  autosolitons  to  propagate  along  the 
interferometer  aperture  even  with  normal  incidence  of 
radiation  on  the  interferometer  gives  wide  opportunities  to 
use  these  structures  in  transverse  transmission  of 
information  and  performing  different  arithmetic  operations. 

In  a  wide-aperture  laser  with  hard  excitation  of 
lasing,  soliton-like  structures  -  laser  autosolitons  -  can 
be  excited.  Their  type  is  intermediate  with  respect  to  the 
two  cases  considered  above.  Really,  the  presence  of  the 
characteristic  level  of  intensity  (the  intensity  of 
stationary  transversely  homogeneous  lasing)  prohibits 
arbitrary  change  of  maximum  autosoliton  intensity.  This 
makes  the  properties  of  the  laser  autosolitons  close  to  the 
properties  of  the  autosolitons  in  passive  interferometers. 
However,  the  absence  of  external  radiation  injection  allows 
excitation  of  single  autosolitons  moving  in  the  transverse 
direction.  Another  specific  feature  of  the  laser 
autosolitons  is  the  shift  of  lasing  frequency  caused  by 
diffractive  and  nonlinear  effects.  Characteristics  of  single 
laser  autosolitons  and  dynamics  of  their  interaction  are 
described. 

In  single-mode  optical  fibers  in  the  range  of  anomalous 
dispersion  with  quaslperiodic  location  of  elements  with 
saturable  gain  and  absorption,  the  temporal  soliton-like 
structures  similar  to  the  laser  autosolitons  can  be  formed. 
With  a  certain  choice  of  parameters,  cw- radiation  with  low 
intensity  is  damped  out  (thus  weak  noises  are  suppressed), 
and  radiation  with  high  intensity  is  stabilized  at  a  certain 
level.  Analogs  of  switching  waves  and  laser  autosolitons 
can  be  realized  for  pulsed  radiation.  Pulses  of  fixed 
self- sustained  form,  with  certain  value  of  peak  intensity 
and  pulse  width,  can  be  transmitted  in  this  fiber  system. 

The  types  of  soliton-like  structures  considered  above 
have  potential  interest  in  the  problems  of  optical 
information  processing  and  optical  computing.  Spatial 
solitons  in  transparent  medium  with  saturable  nonlinearity 
of  the  refractive  index  can  be  used  for  optical  switching 
and  performing  a  number  of  other  elementary  operations. 
Diffractive  autosolitons  in  wide-aperture  nonlinear 
interferometers  allow  to  perform  a  number  of  operations 
within  the  discrete-analogous  method  of  optical  computing, 
e.g.  in  shift  registers  or  multi-digit  full  adders.  Laser 
autosolitons  can  be  used  in  the  schemes  of  information 
storage.  Soliton-like  pulses  in  single-mode  optical  fibers 
with  elements  with  saturable  gain  and  absorption  are 
characterized  by  high  contrast,  high  signal- to-noise  ratio, 
and  stability  of  power  characteristics,  thus  being  promising 
for  information  transmission. 


■"”1 
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We  propon  and  analyse  an  alt.amat.iva  approach  fnr 
conatmotinf  a  parallel  ali-optioal  data  transmission  lino. 
Control  laser  beam*  tun  form  Intermedia to  channels  in  which 
pulses  of  information  could  bs  guidod  without  a  aroaa-Lalk. 


Tha  aolitona  in  optical  fibers  are  proposed  as  highly 
suitable  carriers  of  information  Onf ortunately ,  the  ohangee 
of  tho  femtosecond  soliton  oharaoteriatioa  at  lone  propagation 
diotanoes  may  limit  tliaix  use.  In  this  work  we  propose  an 
altornative  approach  for  constructing  a  porallol  all- optical 
data  transmission  lina.  Ctt  control  beep#  copropagate  in  a 
planar  optical  waveguide  and  fora  intermediate  nonlinear 
channels  in  which  pulses  of  Information  oould  be  guided 
(Fig. 1 ) -  To  ensure  a  reasonably  extended  transmission 
distance ,  the  control  beams  should  propagate  in  the  form  of 
non interacting  fundamental  bright  spatial  soli Lous.  The  signal 
waves  experience  the  influence  of  the  uuutrol  beams  only.  This 
induced  action  originates  in  the  spatial  refractive  index 
change  a long/ across  the  nonlinear  medium  caused  by  the  control 

hnnan 


Fig.  1 
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Th*  evolution  of  the  control-  and  tnn  mgn«i 
(information)-  wavt*  in  ann.lyt.od  usln<  the  aplit-step  Courier 
method.  For  5  control  beams  at  oquel  radii  a  and  intensities 
we  ohUiliad  ••table  propagation  at  a  baas-to-baM  distance 
•^x=l.75a.  F  Lg .  2  represents  the  spatial  evolution  of  the  signal 
pulses  in  such  a  situation.  The  initial  spatial  profiles  of 
the  signal  waves  are  assumed  to  ho  Gauaaian.  Periodic  spatial 
oscillations  of  ths  signals  along  the  nonlinear  medium  without 
•  arose- talk  between  the  information  channels  are  clearly 
expressed . 

tn  ths  configuration  proposed  s  stable  spatial 
guiding  of  the  information  pulses  should  be  sxpectod  even  at 
control- beam  intensity  fluctuations  within  10%.  Tho  uignal 
waves  behavior  is  almost  tho  name  as  under  perfect  initial 
conditions  (Fig. 2).  Fig. 3  shows  the  spatial  evolution  of  tho 
information  pulses  when  they  are  not  per  foot ly  aligned  with 
respect  to  the  centers  of  ths  nonlinear  channels  (  maximum 
initial  deviation  within  25*  the  channel  half-width) .  The 
nonlinear  waveguide  induced  by  the  control  beams  "reflects” 
back  the  signal  wave  preventing  the  inf ormat ion -channel 
cress- talk. 


Fig.  2 


Fig.  3 


The  performance  of  the  parallel  all-optica)  data 
transmission  line  proposed  seems  euuuuragiag.  Further  analyses 
are  under  ear  in  order  to  evaluate  the  restrict  ion  on  the 
Maximum  hit-rate  of  the  parallel  data  transmission  soheme 
proposed. 

This  work  was  done  with  the  financial  support  of  the 
National  Science  Foundation,  Bulgarin,  under  a  contract  F-206. 
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All-fiber  optical  schemes  are  promising  for  high-speed 
information  processing.  Due  to  fiber  broadbanding  their  clock 

frequencies  in  these  schemes  may  achieve  to  the  value  of  f_~ 

12  .  1 
10  Hz  and  more.  In  real,  however,  operation  rate 

restrictions  appear  due  to  insufficient  switching  rate  of 

logical  elements.  As  important  branch  to  win  through  the 

restrictions  is  the  transition  to  speed-of-light  latchless 

architecture  [1,2].  Fiber  elements  serve  here  both  for 

information  transformation  and  for  its  storage.  Particularly, 

operations  of  switching  may  be  realized  on  the  base  of 

nonlinear  directional  couplers  or  fibers  with  nonlinear  ellipse 

rotation,  where  speed  non-resonance  nonlinearities  are  used. 

Functional  abilities  of  schemes  based  on  speed-of-light 

latchless  architecture  essentially  depend  on  the  number  of 

algorithms  realized  in  them. 

In  the  study  reported  as  an  example  of  adding  operation  it 
is  shown  that  symbolic  substitution  algorithm  may  be  realized 
on  the  base  of  speed-of-light  latchless  architecture  approach. 

The  scheme  of  adder  is  presented  at  the  figure  1. 

For  number  representation  two  orthogonally  polarized  modes  of  A 
and  B  fibers  are  used.  At  the  fiber  inputs  clock  pulses  from 
laser  power  source  are  entered  with  polarization, 
corresponding,  for  example,  to  zero.  To  switch  the  polarization 
to  "one"  state  the  control  signals  from  feedback  line  or  input 
block  are  used.  The  switching  is  fulfilled  by  means  of 
nonlinear  ellipse  rotation. 

Then  output  from  the  fibers  signals  are  multiplied  in  couplers 
and  are  entered  to  the  interconnection  block  input.  A  signal  of 
a  high  level  is  being  formed  at  the  one  of  four  outputs  of  this 
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block  depended  on  input  signals  combination  and  is  entered  then 
to  a  feedback  line  in  accordance  with  symbolic  substitution 
algorithm.  A  portion  of  energy  of  signals  is  directed  to 
output.  In  a  feedback  line  additional  one-bit  time  delay  is 
fulfilled  and  owing  to  this  carry  operation  is  realized.  After 
m+l-th  cycle,  where  m  is  digit  capacity,  the  result  of 
calculation  is  formed  at  the  output.  One  can  easily  examine 
that  under  condition  of  signals  relative  coherency  in 
interconnection  block  crosstalk  level  in  the  feedback  line  and 
at  the  output  is  about  25%  of  a  valid  signal.  The  processing 


power  of  the  adder  is  determined  as  v* 


(m+1)4 


At 


v  is  equal  to  6.  25*10  operations/ sec. 
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Fig.l.  The  all-fiber  adder  based  on  simbolic  substitution 
algorithm  scheme. 
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Abstract 


We  propose  an  optical  logic  processor  using  wavelength 
division  multiplexing  in  a  single  optical  fiber. 


During  the  past  decade,  many  attempts  have  been  made  to 
realize  optical  parallel  binary  and  multiple-valued  logic  processing 
based  on  digital  logic  theory  -  see  for  example  [1-4].  In  this  paper 
we  propose  a  novel  optical  method  based  on  wavelength  division 
multiplexing  in  a  single  optical  fiber  to  perform  parallel  binary 
logic.  Our  system  essentially  operates  very  much  like  a  single-hop 
local  lightwave  network.  This  clearly  needs  wavelength  -  agile 
optical  transceivers  for  the  purpose  of  dynamic  coordination,  and  an 
efficient  protocol;  we  propose  the  Receiver  Collision  Avoidance 
Protocol.  The  method  Involves  coding  parallel  pairs  of  data  channels 
with  a  *0*  or  a  *1’  in  packets  in  an  N  data  channel  system  with  one 
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control  channel  which  la  encoded  with  a  streaa  of  *0'  (for  ‘off’ )  and 
'1’  (for  *on' )  to  natch  the  nuaber  of  pairs  of  data  channels  used. 
The  encoded  control  channel  would  then  alert  the  receivers  (detectors) 
operating  in  pairs  to  manipulate  the  output  for  corresponding  pairs  of 
inputs.  In  Table  1  and  Table  2  below  we  illustrate  respectively  the 
AND  function  and  XOR  function  involving  Just  a  pair  of  data  channels 
for  true  logic.  Note  that  identical  pairs  of  segments  in  the  control 
channel  are  such  that  both  on  (off)  means  output  is  1  (0). 


A 

B 

1st  Segment  of 
Control  Channel 

2nd  Segment  of 
Control  Channel 

Output 

0 

sn 

off 

off 

0 

0 

SB 

on 

off 

0 

1 

off 

on 

0 

1 

on 

on 

1 

Table  1 


A 

B 

1st  Segment  of 
Control  Channel 

2nd  Segment  of 
Control  Channel 

Output 

0 

0 

off 

off 

0 

0 

1 

off 

on 

1 

1 

0 

on 

off 

1 

1 

1 

off 

off 

0 
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ABSTRACT 

We  have  studied  die  possibility  of  realizing  all  optical  arithmetic  operations  using  spatial 
solitons  and  their  interaction  properties.  A  nonlinear  planar  waveguide  geometry  has  been 
considered. 


SUMMARY 

Our  scheme  is  based  on  a  couple  of  two  propagating  spatial  solitons  in  a  nonlinear  waveguide, 
under  the  hypotesis  of  transverse  confinement  realized  by  a  third  order  nonlinear  medium,  and 
where  the  relative  phase  is  properly  varied,  so  that  the  interaction  properties  of  solitons  can  be 
utilized  to  perform  the  required  process. 

Our  analysis  starts  from  an  adimensional  nonlinear  Schroedinger  equation1: 


1  d2u 

2  ax 2 


-lutV 


(1) 


Under  proper  conditions  the  propagation  is  described  from  the  following  system  of  coupled 
differential  equations2: 


qu  =-4exp(-2<7)cos(2<l>) 


«  4exp(-2q)sin(2d>)  (2) 

where  q=q(z)  is  the  relative  distance,  along  the  propagation  direction,  at  the  genetically  z 
coordinate,  and  $  is  the  relative  phase  of  solitons. 

In  Ref.  [3]  die  influence  is  studied  of  die  change  of  relative  phase  of  two  input  solitons,  of  equal 
input  intensity,  on  the  intensity  of  the  soliton  whose  phase  has  not  been  changed.  The  intensity 
is  found  to  vary  according  to  the  law: 

/,  =  *,<&  (3) 

where  it,  is  a  constant  depending  on  initial  distance  with  respect  to  the  other  soliton. 

This  means  that  the  phase  information  carried  by  a  pulse  can  be  transferred  to  the  intensity  of 
another  pulse  after  propagation  (intensity  modulation  by  change  in  phase  ). 

By  the  help  of  this  property  all  arithmetic  operations  can  be  realized.  For  example  die  adder 
can  be  realized  when  two  soliton  beams  enter  in  die  same  waveguide  with  a  relative  phase 
equal  to  zero  and  their  interaction  in  the  first  part  of  propagation  is  avoided  by  means  of  a 
proper  shield.  A  beam  of  intensity  /,  propagated  normally  to  the  soliton  path,  changes  via  a 
nonlinear  interaction  die  refractive  index  of  a  lenght  L  of  the  soliton  path  (see  Fig.1),  inducing 

a  phase-change  equal  to  k%Uv  A  second  beam  of  intensity  I2  interacts  with  the  same  soliton 
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for  the  same  length  L  inducing  a  further  phase  change  equal  to  k^Ut.  The  total  phase  change 

is  ft#L(/,  +/a).  If  now  we  let  die  two  solitons  to  interact,  they  interfere  according  to  the 

intensity  modulation  effect,  and  die  intensity  of  one  sditon  becomes  /0  =  +/2),  that  is 

proportional  to  the  sum  of  intensities  of  input  beams,  realizing  an  optical  addition. 

In  a  similar  way  it  is  possible  to  realise  an  optical  subtraction.  In  fact  if  we  consider  die  same 
structure  of  the  adder  where  the  zone  interested  by  die  second  beam  l2  is  composed  by  a 
negative  nonlinear  refractive  index  coefficient,  the  phase  change  induced  over  die  soli  ton  beam 

is  equal  to  -k^U2.  The  intensity  of  sditon  after  interaction  becomes:  /„  =  ktk^L{jt  - I2). 

The  division  is  made  in  a  quite  similar  way  by  using  only  a  beam  of  intensity  /,.  If  we  reduce 
the  length  L  by  a  factor  N,  die  phase  change  induced  over  the  sditon  is  equal  to  kJxL  I N ,  and 

the  output  intensity  becomes  /„  =  kfk^lyLlN .  This  process  is  quite  critical  since  it  depends  of 
our  capability  of  reducing  die  length  L  property. 

The  multiplication  is  made  by  increasing  N  times  the  length  L,  so  that  the  output  intensity 
becomes  /0  =  kik^LNIv 


BEAM  OF  BEAM  OF 
INTENSITY  I,  INTENSITY  I2 


OUTPUT  BEAM 
O*  INTENSITY  - 
l0-k.k,L(l1.I2) 


SHIELD 


IN  THIS  AREA  INTENSITY 
MODULATION  CAN  DEVELOP 


FigL  Scheme  of  the  optical  adder.  The  aoiiton  enter  with  the  same  phase. 
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Abstract 

Detection  of  candidate  object  regions  in  a  scene  is  step  one  in  general  scene  analysis.  New 
optical  morphological,  wavelet  and  Gabor  transform  filters  for  detection  are  presented  and 
fusion  is  used  to  reduce  false  alarms. 


Optical  image  processors  must  fulfill  a  variety  of  different  functions  if  they  are  to  find  wide¬ 
spread  use  in  scene  analysis,  pattern  recognition  and  product  inspection.  The  first  three  opera¬ 
tions  performed  in  general  image  analysis  are:  detection  (location  of  candidate  object  regions), 
reduction  of  false  alarms  and  image  enhancement 

We  consider  three  algorithms  for  detection:  morphological  wavelet  transforms  (MWTs), 
macro  Gabor  filters  (MGFs)  and  the  hit-miss  transform  (HMT).  These  are  shown  in  block  dia¬ 
gram  form  in  Figure  1.  All  algorithms  are  realizable  on  the  same  optical  correlator  architecture 
using  different  filter  functions  and  nonlinear  input  and  output  operations.  Each  algorithm  will 
be  highlighted  and  initial  results  shown. 


Detection  of  multiple  classes  of  objects  independent  of  3-D  distentions  and  object  contrasts 
in  the  presence  of  clutter  is  a  formidable  problem.  Thus,  we  expect  a  number  of  false  alarms  in 
these  detection  outputs.  To  reduce  these,  we  employ  algorithm  fusion  as  shown  conceptually  in 
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Figure  2.  The  concept  is  to  produce  all  three  output  detection  planes  and  to  AND  etc.  them. 
This  is  expected  to  reduce  false  alarms,  since  only  false  alarms  present  in  the  same  pixel  loca¬ 
tion  in  ail  three  outputs  will  be  present  in  the  final  fused  result.  Data  showing  a  significant 
reduction  in  false  alarms  will  be  presented. 


Once  candidate  regions  of  interest  have  been  located,  they  are  now  analyzed  further  to  iden¬ 
tify  the  object  present  in  each  region.  Prior  to  this,  image  enhancement  is  performed.  The  mor¬ 
phological  portion  of  the  MWT  algorithm  removes  large  clutter  regions,  the  wavelet  portion  of 
this  algorithm  removes  clutter  with  small  particle  sizes  and  produces  a  clutter  map.  We  also 
employ  morphological  operations  to  enhance  each  such  scene  region.  A  new  fast  and  very  effi¬ 
cient  optical  gray-scale  morphological  algorithm  and  architecture  are  used.  The  architecture 
(Figure  3)  is  quite  attractive  as  it  only  requires  binary  light  modulators.  This  algorithm  and 
architecture  will  be  described  and  initial  results  will  be  presented. 
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Abstract 

Signals  with  significant  overlap  in  both  the  space  and  frequency  domains  may  have  little  or  no 
overlap  in  a  fractional  Fourier  domain.  Spatial  filtering  in  these  domains  may  allow  os  to  eliminate 
distortion  components  which  cannot  be  eliminated  in  the  ordinary  Fourier  domain. 

1  Introduction 

Space-invariant  filtering  may  be  performed  by  multiplying  the  Fourier  transform  of  the  input  signal  by  the  Fourier  transform 
of  the  impulse  response.  Recently  we  have  discussed  how  various  space- variant  operations  can  be  performed  by  multiplying 
with  a  filter  function  in  a  fractional  Fourier  domain  [1].  These  operations  can  be  realised  optically,  because  the  fractional 
Fourier  transform  can  be  realised  optically.  One  approach  is  baaed  on  the  use  of  quadratic  graded  index  media  [2,  3], 
whereas  another  is  based  on  the  use  of  bulk  lenses  [4].  The  graded  index  approach  is  closely  connected  to  the  definition  of 
the  fractional  Fourier  transform  in  terms  of  its  spectral  decomposition,  whereas  the  built  implementation  is  closely  connected 
to  its  definition  in  terms  of  its  linear  transform  kernel  [1]. 

The  many  mathematical  properties  of  the  fractional  Fourier  transform,  its  relation  to  the  Wigner  space-frequency 
distribution,  wavelet  transforms,  and  chirp  basis  expansions,  its  applications  to  signal  processing,  and  issues  relating  to 
its  optical  implementation  are  discussed  in  the  references.  Due  to  limited  space,  we  will  here  content  ourselves  with  the 
presentation  of  two  examples  of  how  space-variant  filtering  can  be  achieved  by  applying  simple  binary  masks  in  fractional 
Fourier  domains.  Among  the  many  things  we  cannot  mention,  of  particular  interest  is  correlation  in  fractional  Fourier 
domains  and  its  application  to  pattern  recognition. 

2  Definition  of  the  fractional  Fourier  transform 

The  ath  order  fractional  Fourier  transform  of  a  function  /(•)  is  denoted  by  ^*[/](x)  and  may  be  defined  as: 

/°° 

'"[sin ^|~l7a~  e*pl‘>l*a  cot*  -  2xx  esc  4>  +  xa  cot^)]  }{x')  dx' ,  (1) 

where  4>  =  air/ 2  and  i  =  sgn(sin^).  Some  of  its  properties  are:  i.)  linearity;  ii.)  and  T*  correspond  to  the  identity 
operation;  iii.)  f*  corresponds  to  the  conventional  Fourier  transform;  iv.)  T*1  /*»  =  T”'  +°3 . 

One  of  the  most  important  properties  states  that  performing  the  ath  fractional  Fourier  transform  operation  corresponds 
to  rotating  the  Wigner  distribution  by  so  angle  4  —  >(r/3j  in  the  clockwise  direction.  We  are  unable  to  discuss  the  Wigner 
distribution  here,  although  it  is  important  to  fully  understand  the  filtering  examples  discussed  below.  The  reader  is 
encouraged  to  consult  [1]  and  the  references  given  there.  Roughly  speaking,  the  Wigner  distribution  of  a  function  /(-), 
denoted  by  Wf(x,  i/),  can  be  interpreted  as  a  function  that  indicates  the  distribution  of  the  signal  energy  over  space  x  and 
frequency  v.  Defining  the  rotation  operator  for  two-dimensional  functions,  corresponding  to  a  counterclockwise  rotation 
by  <t>,  the  property  mentioned  above  can  be  expressed  as  [/](*>  v)  =  R -4W,(x  ,  v).  Another  version  of  this  property  [6] 

is  Ki\Wj(x,  v)]  =  1^*[/]|2,  where  the  operator  R+  is  the  Radon  transform  evaluated  at  the  angle  <)>.  The  Radon  transform 
of  a  two-dimensional  function  is  its  projection  on  an  axis  making  angle  4>  with  the  x  axis. 

3  Filtering  examples 

Consider  the  signal  exp{-ir(x  -  4)J]  distorted  additively  by  exp{  - ixx3  )rect(x/ 1 6) .  The  magnitude  of  their  sum  is  displayed 
in  part  a.,  on  the  left  hand  side  of  the  figure.  These  signals  overlap  in  the  frequency  domain  as  well.  In  part  b.,  we  show 
their  a  =  0.5th  fractional  Fourier  transform.  We  observe  that  the  signals  are  separated  in  this  domain.  The  chirp  distortion 
is  transformed  into  a  peaked  function  which  does  not  exhibit  significant  overlap  with  the  signal  transform,  so  that  it  can 
be  blocked  out  by  a  simple  mask  (part  c.).  Inverse  transforming  to  the  original  domain,  we  obtain  the  desired  signal  nearly 
perfectly  cleansed  of  the  chirp  distortion  (part  d.). 

Now  we  consider  a  slightly  more  involved  example  in  which  the  distorting  signal  is  also  real.  The  signal  exp(-rxJ)  is 
distorted  additively  by  cos(2v(rJ/2  -  4x)]rect(x/8),  as  shown  in  part  a.,  on  the  right  hand  side  of  the  figure.  The  a  =  0.5th 
transform  is  shown  in  part  b.  One  of  the  complex  exponential  chirp  components  of  the  cosine  chirp  has  been  separated  in 


Figure  1:  Example  1  (left)  and  example  2  (right) 


this  domain  and  can  be  masked  away,  but  the  other  (till  distorts  the  transform  of  the  Gaussian.  After  masking  out  the 
separated  chirp  component  (not  shown),  we  take  the  a  =  -1st  transform  (which  is  just  an  inverse  Fourier  transform)  to 
arrive  at  the  a  —  —0.5th  domain  (part  c.).  Here  the  other  chirp  component  is  separated  and  can  be  blocked  out  by  another 
simple  mask.  Finally,  we  take  the  0.5th  transform  to  come  bade  to  our  home  domain  (part  d.),  where  we  have  recovered 
our  Gaussian  signal,  with  a  small  error. 

The  examples  above  have  been  limited  to  chirp  distortions  which  are  particularly  easy  to  separate  in  a  fractional  Fourier 
domain  (just  as  pure  harmonic  distortion  is  particularly  easy  to  separate  in  the  ordinary  Fourier  domain).  However,  it  is 
possible  to  filter  out  more  general  types  of  distortion  as  well.  In  some  cases  this  may  require  several  consecutive  filtering 
operations  in  several  fractional  domains  of  different  order  [1].  There  is  nothing  special  about  our  choice  of  Gaussian  signals 
other  than  the  fact  that  they  allow  easy  analytical  manipulation.  Also,  there  is  nothing  special  about  the  0.5th  domain.  It 
just  turns  out  that  this  is  the  domain  of  choice  for  the  examples  considered  above. 

In  the  above  examples  we  have  demonstrated  that  the  method  works,  but  did  not  discuss  what  led  ub  to  transform 
to  a  particular  domain  and  what  gave  us  the  confidence  that  doing  so  will  get  rid  of  the  distortion.  This  becomes  very 
transparent  once  one  understands  the  relationship  between  the  fractional  Fourier  transform  and  the  Wigner  distribution. 
This  relationship,  as  well  as  the  general  philosophy  behind  such  filtering  operations  is  discussed  in  [1]. 

4  Conclusion 

What  we  know  as  the  space  and  spatial  frequency  domains  are  merely  special  cases  of  fractional  domains.  These  domains 
are  indexed  by  the  parameter  a.  The  representation  of  a  signal  in  the  ath  domain  is  the  nth  fractional  Fourier  transform 
of  its  representation  in  the  a  =  0th  domain,  which  we  define  to  be  the  space  domain.  The  representation  in  the  a  =  1st 
domain  is  the  conventional  Fourier  transform.  If  we  set  up  a  twodimensional  space,  called  the  Wigner  space,  such  that  one 
axis  (x)  corresponds  to  the  a  —  0th  domain  (the  conventional  space  domain)  and  the  other  (v)  to  the  a  =  1st  domain  (the 
conventional  spatial  frequency  domain),  then  the  ath  domain  corresponds  to  an  axis  making  an  angle  <g  =  ax/2  with  the  x 
axis. 

A  desired  signal  and  noise  may  overlap  in  both  conventional  space  and  frequency  domains,  but  not  in  a  particular 
fractional  domain.  Even  when  this  is  not  the  caee,  spatial  filtering  in  a  few  fractional  domains  in  cascade  may  enable  the 
elimination  of  noise  quite  conveniently.  It  is  possible  to  implement  these  operations  optically. 

It  is  a  pleasure  to  acknowledge  the  coeitributians  of  A.  W.  Lohmaim  of  the  University  of  Erlangen-Numbergin  the  form 
of  many  discussions  and  suggestions. 
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ABSTRACT  State-of-the-art  review  of  all -optical  dymanic 
memories  and  their  crucial  elements  is  given.  It  discusses  the 
properties  of  all -optical  regenerators  of  short  optical  pulses. 
Characteristics  and  properties  of  an  optical  dynamic  memory 
operating  through  SRS  in  optical  fibers  are  described. 

SUMMARY  An  all -optical  dymanic  memory  CAODM3  is  designed  to 

store  information  in  the  form  of  optical  pulses.  AQDM  is  needed 

in  optical  digital  computers,  telecommunication  systems,  for 

signal  coding  and  decoding  devices  etc. 

To  be  competiti ve, A00M  should  operate  with  rather  short 
—11  —12 

pulses  CTp=10  -10  s!>  and  provide  almost  unlimited  storage 

time  Cr  ).  In  the  simplest  case  of  moderate  storage  times  Cr  < 

-5  m  m 

lO  s3  AODM  can  be  simply  an  optical  fiber  delay  line.  However, 

at  long  that  is  achieved  by  circulating  pulses  through  a 

fiber  ring,  AODM  must  include  an  amplifier  to  compensate  for 

optical  losses  and  a  nonlinear  threshold  element  to  restore  the 

shape  and  time  position  of  pulses,  i .  e.  ,  to  provide  the 

functional  stability  of  the  scheme.  Such  a  nonlinear  element 

that  has  to  suppress  a  weak  signal  and  form  a  required  standard 

level  of  a  high  output  signal  provides  a  necessary  signal /noise 

ratio,  the  ratio  between  logic  one  and  zero,  and  also  a  precise 

pulse  timing.  In  essence,  this  element  is  a  regenerator  of 

light  pulses. 

Since  a  large  bandwidth  is  required  in  AODM  CAf  * 
101^-101^Hz3i t  is  most  suitable  to  use  fiber  optic  logic  or 
switching  elements  of  the  type  of  nonlinear  Mach-Zendner  or 
Sagnac  interferometers,  and  elements  based  on  the  Kerr  effect, 
soliton  interaction  or  SRS  as  regenerators.  A  typical  drawback 
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of  fibre  elements.  i.e.  a  long  latency  time,  does  not  play  an 

important  role  in  AODM.  As  an  example  of  a  regenerator  with 

high  parameters  a  nonlinear  Sagnac  interferometer  (11  can  be 

mentioned.  Its  parameters  are:  clock  frequency  5  Gbit/s,  r  3  10 

P 

ps,  signal  peak  power  -  as  low  as  0.74  W,  interferometer  loop 
length  -  S  km,  and  switching  contrast  -  15  db.  Nevertheless  in 
spite  of  evident  progress  in  various  particular  elements 
constituting  AODM  there  are  little  published  data  on  the 
experimental  investigation  of  AODM  as  a  whole.  This  report  will 
consider  the  main  principles  of  operation  of  optical 
regenerators  and  will  consider  in  detail  the  experimental  model 
of  AODM  developed  at  A.  F.  Ioffe  Physical  Technical  Institute 
(21.  The  AODM  scheme  uses  a  logic  invertor  operating  through 
SRS  [ 33  as  a  regenerator  and  amplifier  simultaneously.  The 
parameters  of  the  scheme  are  as  follows  123.  Storage  time  is 
almost  unlimited,  memory  capacity  -  500  bit,  clock  frequency  - 
lOO  MHz.  pump  wavelength  -  1.00  pm,  average  energy  consumption 
-  2.5  W.  pulse  duration  -  70-100  ps,  ratio  between  logic  one 
and  zero  levels  -  50-100. 

Another  approach  of  great  interest  are  the  schemes  using 
solitons.  Here  the  pulse  shape  is  preserved  on  signal 
propagation  along  a  large  fiber  length.  Therefore  increase  of 
the  memory  capacity  and  storage  time  can  be  achieved  sometimes 
in  a  more  economical  way  141. 

To  summarize,  a  fiber  AODM  is  one  of  those  optical  devices 
that  are  of  great  scientific  interest  and  can  find  practrical 
use. 
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ABSTRACT 

1-D  hologram  technology  for  parallel  information  recording  and 
processing  and  scam  applications  are  considered.  Among  them  are  a  two-layer 
neural  net  performing  serial -parallel  processing,  and  an  associative  memory 
using  photothermoplastic  material.  Both  the  theoretical  and  experimental 
results  are  presented. 


SUNARY 

A  disk  memory  system  using  1-D  hologram  technology  and  seme 
applications  for  information  processing  are  discussed. 

One  of  applications  is  an  optoelectronic  neuroprocessor  using  1-D 
hologram  technology.  The  neuroprocessor  processes  information  in  a 
serial -parallel  way,  which  allows  implementation  of  a  high-capacity  neural 

network  with  up  to  10to  interconnections,  the  number  of  real  neurons  being 
small.  This  is  achieved  by  means  of  serial  accumulation  of  signals  during 
continuous  disk  rotation. 

The  artificial  neuron  used  comprises  a  pair  of  photodiodes  and  an 
electronic  scheme  with  thresholding.  Signals  are  accumulated  in  time  using 
capacitors.  That  means  the  summation  of  signals  weighted  with  weights  T  . 

Intemeuron  connections  are  incremented  by  recording  a  1-D 
microhologram  array  on  the  disk.  The  structure  of  intemeuron  connections 
can  be  changed  by  using  different  tracks  corresponding  to  the  different 
models  of  neural  networks.  The  disk  can  also  be  replaced,  if  necessary. 

The  holographic  neuroprocessor  having  200-mm  disk  with  more  than  1 
Gbyte  capacity  provides  the  transfer  rate  of  16  Mbit/s,  REN  being  equal  to 

150.  This  allows  implementation  of  neural  networks  having  10s  neurons  in 
each  layer  or  5000  different  neural  nets  having  1000  neurons  in  each 

layer.  In  this  case  the  processing  time  of  one  layer  is  equal  to  120  ms. 
Note  that  cne  has  an  opportunity  to  increase  the  processing  rate  by  1  or  2 
orders  of  magnitude  by  increasing  REM,  laser  power  and  the  sensitivity  of 
the  photodetector  array. 

Another  application  relates  to  a  holographic  associative  memory 
wherein  1-D  information  is  recorded  on  photothermoplastic  material. 
Information  is  recorded  in  the  bipolar  form  using  separation  by 
polarization.  Two  version  of  the  memory  have  been  investigated:  with 
angularly  superimposed  holograms  and  with  spatially  separated  ones. 

Linear  electrooptic  32-channel  light  modulators  were  used  to  form  both 
information  arrays  and  reference  sources  in  the  process  of  recording  and 
correlation  retrieval.  The  arrangement  of  modulators  and  linear 
photodetector  arrays  minimizes  the  overlapping  of  correlation  signals.  1-D 
information  recording  is  performed  sequentially,  each  information  array 
being  recorded  with  a  separate  reference  beam,  using  a  separate  channel  of 
a  reference  modulator. 

Vtien  the  correlation  analysis  is  performed,  a  key  array  is  fed  to  the 
signal  modulator  for  associative  search.  The  retrieved  correlation  signals 
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arc  detected  by  photodetectors  of  correlation  signals.  Electric  correlation 
signals  are  than  processed  to  determine  the  nuabar  of  the  array  which 
corresponds  aost  of  all  to  the  key  array  according  to  one  or  another 
criterion. 

lb  retrieve  information  of  the  selected  array,  the  signal  modulator  is 
turned  off  and  the  respective  channel  of  the  reference  aodulator  is  turned 
oh.  The  bean  restores  the  selected  information  array  on  the  signal 
photodetectors . 

In  the  experiment  ten  information  arrays  were  recorded  in  the 
holographic  associative  memory  with  spatially  separated  holograms.  Each 
area  of  the  photothermoplastic  was  charged  and  developed  separately.  The 
diffraction  efficiency  of  holograms  was  in  the  range  of  5-7%.  The 
associative  search  was  investigated  by  feeding  parts  of  the  recorded  arrays 
to  the  modulator.  The  probability  of  an  array  selection  error  versus  number 
of  bits  in  the  key  array  after  100  recording/erasing  cycles  was  determined. 
As  has  been  found,  the  scheme  with  spatial  separation  is  more  preferred  in 
case  photothermoplastic  holographic  media  are  used. 
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Abstract 

New  advances  on  invariant  pattern  recognition  using  associative  optical 
memories,  showing  simulations  and  first  experimental  results,  on 

1 )  orientation  detection  of  any  object  before  identification, 

2)  pattern  recognition  invariant  to  scale  and  projections. 


Summary 

To  increase  the  computing  efficiency  with  moderate  data  flow  (Video  rate  typical), 
the  original  idea  of  the  hybrid  neural  network  architecture  (modified  Hopfield 
type),  is  to  utilise  a  non-linear  optical  amplifier  between  two  cascaded  optical 
correlators,  the  first  one  giving  the  weights  (inner  product  with  central 
correlations),  the  second  yielding  the  weighted  memory  output  for  electronic 
thresholding  and  feed-back  to  the  optical  processor.  In  the  implementation 
proposed  at  OC'92  (1],  the  high  capacity  programmable  optical  memory  was  a 
photothermoplastic  plate,  and  the  non-linear  amplifier  was  a  BGO  crystal  with 
phase  conjugation.  It  was  shown  that  a  high  non-linearity  (to  the  order  4  at  least) 
maintains  the  dominant  influence  of  the  T(0)  peak  in  an  extended  window  required 
for  shift  invariance  [2]).  The  progresses  presented  here  do  not  concern  the  non¬ 
linear  optical  layer,  but  the  algorithms  of  the  process,  and  should  be  also  of 
interest  for  electronic  computing. 

The  main  improvement  for  the  convergence  is  an  adaptive  thresholding.  In  a 
standard  Hopfield  model,  the  threshold  plane  is  the  correlation  plane,  which 
cannot  be  accessed  here.  Thus  one  utilise  the  Parseval  Theorem,  saying  that  the 
correlation  peak  amplitude  equals  the  energy  of  the  signal.  Our  threshold  criterion 
is  then  the  sum  of  the  output  plane.  Simulations  show  that  the  best  convergence  is 
obtained  with  an  adaptive  threshold  coefficient,  to  increase  the  gain  when  the 
competition  is  low  between  the  output  modes. 

Previously,  the  memory  content  for  orientation  detection  had  to  be  selective  of  one 
class  of  object,  with  limited  tolerance  to  distortions,  and  worked  only  on  contours 
with  low  optical  efficiency.  Now,  excellent  results  have  been  obtained  for  any  type 
of  filled  objects,  using  as  memory  a  simple  line  with  rotated  replicas.  An  example 
is  shown  on  Figure  l.The  selectivity,  tested  with  ellipses  as  input,  reaches  a  1  dot 
difference  in  the  discrete  image. 

After  this  pre-processing,  a  full  invariance  in  pattern  recognition  has  been 
obtained  with  a  new  Log-Log  single  harmonic  filter  (2-D  MelT*'  transform). 
Invariant  to  projections  on  both  axes,  it  does  dot  require  a  Mellin  transform  on  the 
real-time  input  as  in  previous  implementations  [3].  Its  discrimination  capability  is 
illustrated  on  Figure  2. 

Simulations  for  the  associative  memory  with  this  filter  are  on  progress  and  will  be 
shown  at  the  conference,  so  as  the  optical  experiments  for  rotation  pre-processing. 
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Fig. 2  -  Correlation  with  a  Log-Log  single  harmonic 
filter  (  invariant  to  projections  on  both  axes) 

References  :  1)  GJLebreton  et  al.,' "Invariant  pattern  recognition  and  hybrid  neural 
networik",SPTE  Proceed.  Vol.1806, 1993,  pp.305-312;  2)  D.Psaltis,  J.Hong,  "Shift- 
invariant  optical  associative  memories",  Opt.  Eng.  26/1, 1967,  pp.10-15; 

3)  DJfcndlovic,  EJMarom,  N-Konforti,  "Scale  and  projection  invariant  pattern 
recognition",  AO  28/23,  Dec. 1969,  pp.4982-4966. 


TuB2/l67 


Optical  Implementations  of  a  Stochastic  Neural  System 

W.A.  Crossland,  Dept  of  Engineering,  (0223  330264) 

Tnunpington  Street,  Cambridge,  CB2  1PZ 
T.J.  Hall,  Dept  of  Electronic  and  Electrical  Engineering,  (071  873  2151) 

King’s  College,  University  of  London,  Strand,  London  WC2R  2LS 

J.S.  Shawe-Taylor  and  M.  van  Daalen,  Dept  of  Computer  Science,  (0784  443421) 
Royal  Holloway,  University  of  London,  Egham,  Surrey  TW20  OEX 

Abstract 

The  paper  addresses  the  design  of  an  optoelectronic  implementation  of  a  stochastic  bit- 
stream  nenral  system  which  operates  by  manipulating  digital  bit  streams  to  create  emergent 
activation  functions  using  extremely  simple  logic. 

This  paper  investigates  the  possible  strategies  for  implementing  a  stochastic  neural  network 
design  [6]  making  use  of  the  latest  opto-electronic  technologies  [3,  4].  The  network  design 
operates  by  manipulating  digital  bit  streams  to  create  an  emergent  activation  function  using 
extremely  simple  logic.  The  neural  design  and  implementation  strategy  will  potentially  yield  a 
number  of  impressive  benefits. 

•  The  optical  connections  have  a  very  high  bandwidth  which  can  overcome  the  I/O  bottle¬ 
neck  in  VLSI  implementations  of  the  stochastic  neural  design  [5]. 

•  The  stochastic  approach  introduces  real  values  through  a  precisely  controlled  probabilis¬ 
tic  technique,  which  makes  possible  a  complete  and  exact  mathematical  description  and 
simulation  of  the  network  functionality. 

•  In  contrast  to  analogue  implementations,  digital  networks  can  be  combined  without  intro¬ 
ducing  further  uncertainties  in  the  accuracy  of  the  computation.  Hence  implementations 
can  be  scaled  up  without  major  modifications. 

•  Both  the  speed  and  the  digital  nature  of  the  basic  operations  will  mean  that  effective  on- 
chip  learning  may  be  incorporated  into  the  design,  by  exploiting  the  stochastic  properties 
of  the  network  operation. 

A  standard  neural  design  involves  a  network  of  neurons  each  processing  its  inputs  through 
a  weighted  sum  and  calculating  a  sigmoid-like  function  on  the  result.  Hence,  one  of  the  funda¬ 
mental  problems  inherent  in  a  massively  parallel  implementation  of  a  neural  architecture  is  how 
to  multiply  together  the  real  inputs  and  their  corresponding  weights  without  resorting  to  cum¬ 
bersome  bit-parallel  digital  circuitry.  The  analog  solution  to  this  problem  incurs  a  number  of 
difficulties,  including  relatively  low  resolution,  cross-chip  variations  in  component  performance, 
and  the  resulting  problems  in  constructing  large-scale  reliable  systems. 

The  stochastic  bit-stream  approach  combines  the  benefits  of  digital  circuitry  and  analog 
simplicity.  In  this  approach  a  real  value  v  in  the  range  [-1,1]  is  represented  by  a  sequence 
of  bits  [2]  in  which  the  probability  of  each  bit  being  set  to  1  is  (v  +  l)/2.  The  bit  stream 
created  by  the  bit-wise  XOR  of  the  weight  and  input  streams  computes  a  stream  representing 
their  product.  One  bit  of  each  of  these  individual  weighted  input  streams  are  summed  and 
compared  with  a  threshold  value  to  determine  a  bit  of  the  output  stream.  The  interaction  of 
the  probability  distributions  of  the  sum  and  the  thresholds  creates  a  sigmoid-like  functionality, 
which  can  be  precisely  described  [5].  The  design  of  a  single  neuron  is,  therefore,  extremely  simple, 
making  it  possible  to  map  a  large  network  onto  an  established  implementation  technology.  The 
functionality  of  the  neuron  will,  however,  be  demonstrated  with  simulations  of  the  application 
of  the  Mean  Field  Annealing  algorithm  to  the  graph  bisection  problem.  The  quality  of  solution 
and  number  of  iterations  is  comparable  with  standard  neurons,  though  if  the  neurons  were 
implemented  in  massively  parallel  technology  large  graphs  could  be  processed  in  real  time  using 
the  bit  stream  technique. 
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In  thi>  paper  we  will  pretext  two  opto- electronic  implementations  of  the  neural  architecture. 
They  are  illustrated  in  figure  1.  One  design  is  fully  spatially  multiplexed  (maximally  parallel), 
whilst  the  other  introduces  the  extra  dimemsion  of  time  multiplexing.  A  neuron  in  a  time 
multiplexed  system  receives  the  first  hit  from  each  connection  in  turn  and  only  delivers  an 
output  bit  when  all  have  been  processed.  This  considerably  simplifies  the  design  of  the  neurons, 
but  at  the  expense  of  their  operational  speed.  It  also  simplifies  the  beamlet  optics  for  a  given 
number  of  neurons  when  compared  to  the  fully  spatially  multiplexed  design. 


Figure  1:  The  spatial/time  multiplexing  scale  of  implementations 
Both  of  the  optical  implementations  require  image  replication  and  beam  handling  optics. 
Image  replication  has  been  demonstrated  [4]  using  computer  generated  holograms  in  conjunction 
with  bulk  lense  systems  to  produce  output  arrays  consisting  of  up  to  1000  x  1000  beamlets.  Beam 
detection  is  achieved  by  the  use  of  silicon/GaAs  photodetectors,  and  modulation  is  possible 
through  the  use  of  various  opto-electronic  devices.  Modulation  is  most  easily  achieved  with  liquid 
crystal  spatial  light  modulators,  but  only  at  relatively  slow  speeds  in  the  region  of  10  MHz  may 
be  realised.  We  will  therefore  incorporated  faster  devices  such  as  LED  based  photothyristors  [1] 
featuring  switch  off  times  less  than  10ns,  or  arrrays  of  individually  addressable  surface  emitting 
semiconductor  lasers  [3]  operating  at  speeds  up  to  1  GHz.  The  choice  of  detectors  and  modulators 
will  depend  of  the  overall  system  architecture.  For  example,  the  time  multiplexed  designs  will 
contain  high  speed  subsystems,  that  will  require  the  use  of  the  faster  opto-electronic  devices. 
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Abstract 

An  attempt  is  made  to  exploit  the  inherently  rich  physics  of  volume  holographic  interconnects.  The  effects  of 
crosstalk  and  nonlinearities  are  considered.  When  combined  with  a  novel  feedback  configuration,  complex, 
seif-organising  behaviour  is  seen  to  emerge. 


Summary 

Volume  holographic  interconnects  represent  one  way  of  applying  optics  in  connectionist  approaches  to  signal 
processing.  They  combine  the  usual  advantages  of  optical  interconnects,  together  with  high  storage  capacities 
(>109  weighted  connections  cnr3)  and  offer  the  possibility  of  weight  update  during  training  [Pack  et  al]. 

However,  when  used  in  the  'conventional'  way,  several  factors  constrain  their  performance. 
These  include  effects  such  as  diffracted  amplitudes  being  a  nonlinear  function  of  grating  strength;  cross  gratings 
and  multiple  grating  interactions  giving  crosstalk;  the  recording  of  modifying  of  gratings  invariably  changing 
the  strengths  of  other  gratings  in  the  hologram  and  nonlinearities  introduced  by  the  recording  process  itself.  All 
the  foregoing  limit  the  use  of  volume  interconnects,  even  when  the  training  process  takes  account  of  these 
factors  [slinger].  An  alternative  approach  is  to  attempt  to  use  these  phenomena  to  good  advantage  -  in  effect,  to 
exploit  more  fully  the  rich  physics  of  this  dynamic,  multiple  grating  system.  This  may  enable  it  to  perform 
useful  information  processing  tasks  in  its  own  right  To  investigate  this  idea,  it  is  necessary  to  study  the  regimes 
of  behaviour  of  the  interconnect  in  various  configurations. 

One  such  novel  system  is  shown  schematically  in  figure  1.  A  single  mode  fibre  has  been 
added  to  the  conventional  volume  interconnect  arrangement  This  fibre  feeds  light  from  one  output  back  into 


Figure  1. 

Schematic  of  the  fibre 
feedback  modification  to  the 
usual  volume  interconnect 
arrangement  Note  the 
dynamic  holographic 
material  (e.g.  a  liquid  crystal 
ceO  or  a  photorefractive) 
and  the  absence  of  a  training 
array. 


The  governing  equations  can  be  written  as  : 


input  1,  via  a  beamsplitter  of  reflectivity  R. 


LENS  LENS 


{"  S  +  iH+  £ ;  C-  =  0 

mm 

-AGm  +  Bam^ 


where  n  is  the  bulk  refractive  index  of  the  hologram  material,  c  is  the  velocity  of  light  6m  is  the  angle  of 
propagation  of  wave  m,  amplitude  a*.  is  the  tm  th  grating  strength,  y,  A  and  B  are  material  constants. 
Equations  (1)  are  coupled  wave  equations  describing  the  diffraction  of  the  waves  as  they  propagate  through  the 
hologram,  whilst  equations  (2)  describe  the  grating  formation  in  the  hologram  -  for  the  case  shown,  a 
photorefractive  type  response  is  assumed.  The  boundary  conditions  can  be  written  as  : 


aj^x  =  0 ,  r)  =  form*! 

°lx =0,t)  m  0,0(1-*)  +  R  (1  -L)  a^x-d,t)  exp(j$) 


(3) 
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The  feedback  from  the  fibre,  combined  with  the  time  dependant  nature  of  the  gratings  in  the  hologram,  endows 
the  system  with  very  rich  and  varied  behaviour.  In  particular,  for  certain  system  parameters,  the  output  of  the 
interconnect  can  exhibit  multiple  fixed  point,  limit  cycle  and  strange  attractor  behaviour.  These  arise  as  the 
system  essentially  performs  a  N  dimensional  complex  mapping : 

-  A«w,'  w,)  (4) 

where  a(x)t  =  [a,(x,t).  a^x,t), . a/Jx,t)\T  is  the  vector  representing  the  diffracted  wave  amplitudes,  Q(x)t  is  the 

matrix  of  the  grating  strengths  (m,n=  1,2,3,.jV),  at  hologram  depth  x  and  time  t.  afO),  can  be  regarded  as  an 
input  pattern  presented  to  the  system.  This  pattern  input,  together  with  the  material  and  other  system 
parameters,  determines  the  time  evolution  of  the  system  output 

Figure  2,  for  example,  shows  a  bifurcation  diagram  resulting  from  varying  the  loss  factor  L  in 
the  fibre  feedback  loop.  The  figure  indicates  the  variety  of  complex  behaviour  possible  with  this  system,  by 
changing  a  single,  easily  modifiable,  parameter. 


Such  complex  modes  of  behaviour  are  being  increasingly  viewed  as  advantageous  in  pattern  processing.  For 

example,  the  ability  of  a  system  to  change  its  dynamics  during  a  search  task  has  been  demonstrated  to  give 

superior  classification  performance  [Nara  et  al].  Additionally,  complex  dynamics  in  a  classifier  can  also  give  it 

advantages  in  noisy  environments.  The  volume  interconnect  system  described  here  also  has  direct  equivalence  ( 

to  many  features  of  the  coupled  map  lattice  paradigm  [Kaneko].  Looked  at  in  a  slightly  different  way,  the 

iterative  processing,  generated  by  the  feedback,  endow  the  system  with  properties  not  obtained  with  layered 

networks  of  comparable  size. 

Self  organisation  in  pattern  processing  can  be  useful  as  a  preprocessing  stage  of  a  pattern 
recognition  system  (e.g.  as  in  Kohonen  s  self  organising  feature  maps).  The  interconnect  described  here  also  has 
its  own  seif  organising  features.  In  particular,  it  is  able  to  associate  input  patterns  with  different  attractors  at  its  \ 

output  Inputs  that  differ  slightly  can  be  classified  to  give  rise  to  the  same  attractor.  In  this  way,  the  system 
performs  classification  and  clustering  without  external  supervision. 

In  conclusion,  a  system  has  been  formulated  that  exploits,  more  fully,  the  complex  physics 
available  in  the  volume  interconnect  This  includes  the  use  of  many  processes  conventionally  regarded  as 

detrimental  to  interconnect  behaviour.  Unusual,  and  possibly  beneficial,  modes  of  behaviour  result  1 
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Abstract 

A  family  of  lateral  inhibition  architectures  which  use  the  self-linearised  SEED  effect  to 
implement  optical  subtraction  are  described,  and  the  operation  demonstrated  in  simulation. 

Introduction. 

Lateral  inhibition  networks,  where  nodes  within  a  processing  layer  inhibit  one  another, 
form  a  very  important  class  of  networks.  They  are  not  amenable  to  processing  on  conventional  serial 
machines,  since  they  require  some  type  of  iterative  self-consistent  solution.  But  it  is  this  property 
that  makes  such  networks  suited  to  parallel  implementation,  and  the  dense,  recurrent  nature  of  the 
interconnection  suggests  an  optical  approach. 

The  essence  of  any  inhibition  network  is  that  the  activity  level  of  an  individual  node  must 
decrease  in  response  to  increasing  input  from  neighbouring  nodes  i.e.  subtraction,  which  is  difficult 
to  do  optically.  One  approach  has  been  to  use  an  optically  coupled  pnprt  light  emitter  and  an  npn 
phototransistor.fi]  An  alternative  possibility  is  to  use  the  Self  Linearised  SEED  effect,  observed  in 
a  serial  photodiode-SEED  modulator  combination. [2,3]  Feedback  provided  by  the  common  current 
causes  the  modulator  reflectivity  to  decrease  as  the  light  falling  on  the  photodiode  increases.  The 
small  range  of  modulation  can  be  improved  by  incorporating  the  SEED  device  in  a  resonant  optical 
cavity.[4]  This  combination  is  made  more  attractive  when  we  consider  that  the  SEED  is,  itself,  a 
pin  diode,  allowing  the  possibility  that  the  detector  and  modulator  can  be  made  together  in  the 
same  integrated  process.  However,  certain  difficulties  are  experienced  using  the  quantum  well 
diode  as  a  photodiode,  as  it  is  not  voltage  independent.  Miller  has  suggested  3  ways  of 
circumventing  this  problem.[3] 

Having  established  that  we  have  a  good  optical  subtraction  technology,  we  will  go  on  to 
investigate  architectures  in  which  this  effect  can  be  utilised. 

Simple  lateral  inhibition. 

The  simplest  lateral  inhibition  optical  system  we  can  conceive  of  is  shown  in  figure  1  in  a 
one-dimensional  format,  although  this  can  be  extended  to  a  two  dimensional  computing  surface. 
Each  node  consists  of  a  detector-modulator  pair,  and  receives  an  external  input  which  is  incident  on 
the  modulator.  A  portion  of  the  reflected  signal  is  redirected  to  the  detectors  of  neighbouring  nodes 
in  some  manner,  which  defines  the  lateral  interconnection  function  g(x).  The  node  does  not  inhibit 
itself.  The  optical  element  is  not  specified  and  may  be  reflective,  refractive,  diffractive,  or 
holographic.  Thus,  the  reflectivity  of  a  given  modulator  is  dependent  on  the  total  inhibitory 
signal  received  from  neighbours,  and  decreases  linearly  in  response  to  it.  A  point  to  note  is  that  the 
internal  state  of  the  node,  which  we  identify  with  the  modulator  reflectivity,  is  determined 
purely  by  the  neighbours,  and  is  not  dependent  on  the  input  to  that  node,  although  the  final  output 
is.  This  single  pass  geometry  effectively  results  in  a  convolution  of  the  input  with  the  lateral 
interconnection  function.  Figure  2  shows  the  results  of  a  typical  simulation  with  flat  lateral 
inhibition  ±3  nodes.  A  periodic  boundary  condition  is  imposed  to  avoid  edge  effects.  This  network 
performs  a  simple  edge-enhancement  function.  The  response  of  the  system  reflects  the  essentially 
linear  nature  of  the  device.  While  still  linear,  the  state  of  the  node  can  be  made  positively 
dependent  on  the  input,  as  well  as  inhibited  by  neighbouring  nodes,  by  using  a  differential  pair  of 
detectors,  following  the  example  of  Miller.(3J 

Lateral  inhibition  with  feedback 

By  introducing  a  nonlinear,  recurrent  feedback  of  the  output  to  the  input  winner-take-all 
(W-T-A)  behaviour  can  arise.  This  can  be  done  by  using  a  second  modulator  array  for  input,  as 
shown  in  figure  '.  The  modulator  array  is  uniformly  illuminated  and  the  external  inputs  presented. 
The  modulated  beams  are  then  directed  to  the  laterally  connected  layer  as  before.  However,  if  the 
output  is  monitored  and  used  to  control  the  input  modulator  array  a  recurrent  loop  is  established. 
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All  that  is  required  is  that  the  input  is  presented  for  a  sufficient  time  for  the  feedback  loop  to  be 
established,  thereafter  the  input  is  removed  and  the  system  converges  to  a  stable  state.  Again,  this 
is  a  one-dimensional  example  of  a  possibly  two-dimensional  surface. 

Cheng  &  Wan  have  shown  that  there  must  be  gain,  however  small,  in  the  feedback  loop;  if 
not,  all  signals  will  decay  to  zero.(5]  More  importantly,  from  an  optical  point  of  view,  they  have 
examined  the  situation  of  limited  lateral  inhibition.  Global  interconnection,  leading  to  a  single 
winning  node,  is  not  feasible  in  an  optical  system,  as  finite  optical  power  can  only  be  distributed  to 
a  limited  neighbourhood.  They  have  shown  that  in  a  finite  interconnection  network  a  winner  will 
arise  in  a  neighbourhood  defined  by  the  range  of  interconnection.[51  Thus,  there  are  local  winners 
where  the  width  of  each  competing  cell  is  roughly  determined  by  the  lateral  interconnection 
distance.  A  point  not  stressed  by  Cheng  &  Wan  is  that  each  cell  must  have  a  winner,  even  if  the 
input  in  that  area  is  perfectly  uniform.  Small  spurious  variations  arise  during  the  iteration  process, 
and  these  become  amplified,  eventually  giving  a  "winner”.  Figure  4  shows  an  example  of  an  input 
and  the  final  result  that  the  simulation  converged  upon.  As  can  be  seen,  local  maxima  are  located, 
but  a  spurious  peak  also  occurs  (around  node  22).  Such  a  system  could  form  the  heart  of  a  self- 
organising  network,  when  prefixed  by  a  Hebbian  net  work. (61 

A  further  nonlinearity  is  introduced  in  shunting  networks  where  the  lateral  inhibition 
signal  is  modulated  (multiplied)  by  the  value  of  the  node.  (71  In  the  context  of  this  work  an  optical 
implementation  can  be  envisaged  whereby  the  inhibition  optical  signal  is  reflected  from  the  node 
modulator  before  detection.  This  would  necessitate  a  more  complex  optical  arrangement.  It  should 
be  noted  that  this  would  not  correspond  directly  to  the  conventional  shunting  network  as,  in  this 
case,  the  inhibition  signal  is  being  multiplied  by  the  modulator  reflectivity  rather  than  the  single 
state  variable  generally  used. 
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Abstract:  Based  on  fractional  Fourier  transforms  a  new  optical  architecture  is  developed  to 
make  compromise  between  shift-invariant  (frequency)  and  position-dependent  filterings,  and  its 
analogy  to  neural  networks  and  corresponding  learning  algorithm  are  presented. 

1.  Introduction 

Recently  Fourier  transform  of  fractional  order  and  their  optical  implementation  had  been 
developed.  [1,2]  Unlike  the  ordinary  Fourier  transform,  Le.  the  fractional  Fourier  transform 
with  order  1,  the  fractional  Fourier  transforms  extract  features  which  combine  both  spatial  and 
spatial-frequency  charactersitics  of  the  original  images.  One  may  easily  expect  that  the 
fractional  Fourier  transformed  image  is  snmatfiity  between  the  original  image  and  full  Fourier 
transformed  image  The  fractional  Fourier  transform  may  be  optically  irrytemantaH  by 
quadratic  graded  index  (GRIN)  media,  which  provide  rotation  in  Wigner  [2]  or  Quasiparticle 
[3]  distribution  function.  Much  simpler  optical  architectures  with  only  single  lens  or  double 
lenses  with  proper  spacing  were  also  reported.  [21  In  this  paper  we  present  a  new  optical 
architecture  for  shift-invariant  (spatial  frequency)  and  position-dependent  filterings  based  on  2 
fractional  Fourier  transforms,  and  devebpe  analogy  between  this  architecture  and  neural 
networks.  Adaptive  learning  algorithm  to  design  the  filters  or  neural  networks  is  presented 

2.  Optical  Spatial  and  Spatial-Frequency  Filter  based  on  Fractional  Fourier  Transforms 

The  fractional  Fourier  transform  of  order  p=l/N  performs  ordinary  Fourier  transform 
when  applied  N  times  sequentially.  Its  shifting  rule  is  worth  noticing.  Assuming  die  frac¬ 
tional  Fourier  transform  of  order  p  for  the  idx)  is  jF*(u(x))=£/(fcA  I  Fp(u(x-b))\  becomes 

I U(k  -  b  cos  <M  where  $=p  n/2.  [1]  For  ordinary  Fourier  transform,  La  p=l,  I  Fl(idx-b))\ 
is  \U(k)\  to  result  in  shift  invariance.  For  fractional  Fourier  transforms  this  shift  invariance 
is  somewhat  destroyed  and  position-dependent  information  is  maintained.  When  input  image 
is  filtered  at  Fourier  plane  in  the  4-f  matched  filter,  only  spatial-frequency  filtering  is 
performed  and  results  in  convolution  integral  This  space- invariant  input-to-output  mapping 
is  useful  for  target  localization  and  tracking,  but  not  desirable  for  many  classification  or 
associative  memory  of  many  images.  Slight  local  shift  invariance  is  usually  enough  and 
preferred 

We  substitute  the  ordinary  Fourier  transforms  in  the  4-f  matched  filter  into  fractional 
Fourier  transforms  with  orders  pi  and  pg.  Following  Ref.  [2],  we  adopt  simple  one  thin-lens 
architecture  and  explicit  integral  formula  for  a  fractional  Fourier  transform.  Within  the 
approximations  input-output  mapping  relationship  of  tire  pi-ps  filter  is  now  newly  derived,  and 
the  result  for  the  most  interesting  case  (pi+P2=2)  becomes 

v(x)  =  |  U(2)  M-gfp)  expt^-^-1  ofc  ,  (1) 

where  u( 2)  and  v(x)  are  input  and  output,  respectively,  and  =  pix/2  and  #2  =  pjx/2. 
The  h(.)  is  the  ordinary  Fourier  transform  of  the  filter  H(k)  between  the  two  fractional 
Fourier  transforms.  Compared  with  the  4-f  «*»**«*»  filter,  Eq.(l)  now  has  scaling  factor 
(  sin+i)  in  the  filter  function.  Also,  additional  exponential  modulation,  which  emphasizes  spatial 
points  near  x=2,  is  introduced  This  exponential  term  destroys  the  shift-invariance,  and 
provides  position-dependant  classification  wife  slight  local  shift-invariance.  The  relative 
importance  of  the  shift-invariant  (frequency)  filtering  and  position-dependent  classification  is 
controlled  by  the  #1  *  pix/2.  Character,  image,  and  speech  recognitions  might  be  useful 
applications  of  this  optical  filters. 
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Fig  J.  Neural  network  analogy  at  the  spedal/frequeocy  FSg-2  Optical  architecture  at  the  neural  networks 
fitters  based  oo  fractional  Fourier  transforms.  based  on  fractional  Fourier  transforms 

3.  Neural  Networks  based  on  Fractional  Fourier  Transforms 

In  Fig.l  die  neural  network  analogy  of  the  fractional  Fourier  transform  and  the 
spatial/frequency  filters  are  shown,  where  the  un  and  vi  are  input  and  output,  respectively, 
and  Um  and  Vm  are  corresponding  fractional  Fourier  and  inverse  transforms.  The  fractional 
Fourier  transform  operations  are  now  substituted  by  2  synaptic  weights,  WiH  and  wli\  and 
linear  summation  at  Um  and  vt.  For  classification  problems  additional  Sigmoid  function  S(.) 
may  be  followed.  The  ffm's  are  filter  transmissions  between  the  two  fractional  Fourier 
transforms  and  adaptively  trainable.  This  architecture  with  fixed  global  synapses  and 
adaptive  local  control  gains  is  similar  to  the  TAG  (Training  by  Adaptive  Gain)  model  [4],  an! 
the  popular  error  back-propagation  learning  rule  is  still  applicable.  With  proper  definition  of 
total  output  error  one  can  update  the  gm  propotional  to  -ReV  Umtm],  where  the  rm  is 
back -propagated  error  from  5*.  Optical  architecture  for  this  neural  networks  with  error 
back-propagation  learning  is  shown  in  Fig  .2. 

4.  Conclusion 

In  this  paper  we  present  a  new  optical  architecture  for  spatial/traquency  filtering  and 
adaptive  neural  networks  based  an  fractional  Fourier  transforms.  The  developed  learning 
algorithm  for  the  neural  networks  is  also  applicable  to  design  of  the  filters.  The  optical 
architecture  is  similar  to  4-f  matched  filters,  and  requires  only  2  lenses  for  filters  and  3 
lenses  for  single-layer  adaptive  neural  networks  with  back-propagation  learning  rule. 
Extension  to  multi-layer  neural  networks  is  straight-forward,  and  the  error  back-propagation 
algorithm  and  optical  architecture  are  still  valid.  With  this  single  architecture  «nd  devices 
large-scale  optical  implementation  at  adaptive  neural  networks  becomes  feasible. 
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Abstract 

We  present  results  obtained  with  an  all-optical  self-organizing  map  neural  network  applied 
to  digit  recognition.  The  experimental  system  is  built  around  two  Ferroelectric  Liquid  Crystal 
Bistable  Optically  Addressed  Spatial  Light  Modulators  in  a  resonator  configuration. 

Summary 

The  neural  network  we  implement  has  a  conventional  structure  with  an  input  layer  of  neurons 
connected  via  the  synaptic  weights  to  a  second  layer  or  “map”  of  neurons.  The  system  is  based 
on  Kohonen’s  self-organizing  map  [1]  where  the  spatial  neighbourhood  of  the  active  neurons  is 
taken  into  account  during  learning  process  to  produce  a  topological  organization  of  the  activity: 
similar  inputs  produce  similar  map-layer  activities.  The  implementation  is  all-optical  in  that 
electronics  (in  the  form  of  a  personnal  computer)  are  used  only  for  system  synchronization  and 
modification  of  learning  parameters,  all  calculations,  thresholding,  weight  updating  etc  are  per¬ 
formed  in  parallel  in  optics. 

The  key  devices  in  the  optical  setup  are  two  Ferroelectric  Liquid  Crystal  Bistable  Optically 
Addressed  Spatial  Light  Modulators  (FLC-BOASLMs).  If  such  a  valve  is  illuminated  with  a 
given  pattern  and  an  electrical  pulse  is  applied  to  the  electrodes,  the  pattern  can  be  binarized 
and  stored  on  the  valve  for  subsequent  rereading.  One  valve  is  used  in  this  way  to  perform 
the  thresholding  of  the  neural  activity.  Although  the  BOASLMs  are  intrinsically  bistable,  by 
operating  them  near  the  threshold  voltage  and  using  spatial  integration  techniques  they  can  be 
made  to  show  a  grey-level  behaviour  [2].  A  second  valve  is  used  in  this  manner  to  store  and 
update  the  synaptic  weights. 


(a)  Creation  of  the  neural  activity.  (b)  Weight  updating. 
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The  basic  operation  of  the  syster  s  shown  above.  The  input  pattern,  Xt},  is  presented  to 
the  system  with  an  electrically  addressed  SLM.  This  pattern  is  imaged  onto  the  array  of  weight 
maps.  WtJ,  (first  BOASLM)  thus  activating  the  corresponding  weight  maps.  An  optical  crossbar, 
consisting  of  a  lenslet  array  and  a  collimating  lens,  then  images  the  activated  weight  maps  onto 
the  second  BOASLM  thus  realizing  the  required  sum  J2XtJW,r  The  resulting  neural  activity 
is  binarized  and  stored  on  the  second  BOASLM.  The  neighbourhood  function  is  introduced  by 
varying  the  height  and  duration  of  the  voltage  pulse  sent  to  the  BOASLM  to  dilate  or  erode  the 
neural  activity  pattern. 

This  activity  is  read  with  a  plane  wave,  duplicated  with  a  Dammann  grating  and  fed  back, 
through  the  input  image,  to  the  weight  map  BOASLM  where  it  is  added  to  the  active  weight 
maps  and  subtracted  from  the  inactive  ones.  The  effect  is  to  reinforce  the  connections  between 
active  input  neurons  and  active  zones  in  the  neural  map,  while  at  the  same  time  weakening 
non-productive  connections.  In  this  way  a  Hebbian-type  learning  rule  is  implemented. 

Computer  simulations  which  take  the  limits  of  the  optical  system  into  account,  have  con¬ 
firmed  the  feasibilty  of  the  approach;  the  network  correctly  learning  and  classifying  a  set  of  test 
input  images  as  well  as  demonstrating  generalization  capabilities  [3].  The  experimental  system 
has  already  been  shown  to  correctly  recognize  the  different  input  classes  when  initialized  with 
a  weight  map  array  obtained  from  the  simulations.  Our  present  work  is  concentrated  on  the 
demonstration  of  optical  learning,  the  most  recent  results  in  this  direction  will  be  presented  at 
the  conference. 
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Abstract:  We  present  experimental  data  of  a  novel  differential  PnpN  optical  switch,  showing  a 
cycle  time  smaller  than  60  ns  and  an  optical  switching  energy  of  0.2  fJ/pm1 2. 


III-V  optical  PnpN  devices  (also  variously  called  DOES,  VSTEP,...)  are  promising  optoelectronic 
switches.  Very  good  optical  sensitivity  (400  fJ  or  0.07  fl/pm2)  has  been  reported  for  a  differential 
pair  of  optical  PnpN  devices  [I].  However,  the  repetition  speed  of  the  operation  cycle  is  very  slow 
due  to  the  long  switch-off  time  (of  the  order  of  0.1  to  10  ms).  The  switch-off  is  governed  by  the 
slow  decay  of  excess  free  carriers  stored  in  the  center  p  and  n  layers  of  the  element  of  the  pair 
which  was  in  its  on-state  in  the  previous  cycle.  For  correct  operation  of  the  pair,  the  light¬ 
generated  carriers  must  exceed  the  excess  left-over  carriers  of  the  previous  operation  cycle. 
Therefore,  for  operation  above  kilohertz,  the  optical  input  energy  is  large.  In  other  words,  the 
trade-off  between  cycle  speed  and  optical  sensitivity  is  poor. 

We  solved  this  trade-off  problem  by  making  monolithic  differential  pairs  using  the  PnpN  devices 
described  in  the  abstract  of  Heremans  et  al.  [2].  The  devices  are  specially  designed  such  that  the 
center  p  and  n  layers  can  be  completely  depleted  of  free  carriers  in  nanoseconds  by  application  of 
a  negative  anode-to-cathode  voltage  pulse  (“carrier-extraction  phase”)  prior  to  giving  the  optical 
input.  The  area  of  the  thyristors  of  a  pair  is  30  x  40  pm2. 

An  operation  cycle  consists  of  3  phases.  First,  a  carrier-extraction  pulse  (20  ns),  to  reset  the 
thyristors  of  the  pair;  then,  a  light  input  phase  (the  shortest  time  for  this  phase  was  3  ns);  third,  a 
switch-on  phase,  during  which  the  thyristor  which  has  received  the  optical  input  switches  on  and 
emits  light  (20  ns).  The  total  cycle  time  is  thus  smaller  than  60  ns  (limited  by  our  apparatus). 
Such  cycles  can  be  pasted  without  dead  time  in  between.  Therefore,  the  operation  cycle 
frequency  of  our  switch  is  at  least  1 5  MHz.  Figure  1  shows  the  measured  relationship  between  the 
duration  of  the  optical  input  pulse  and  the  optical  input  power  necessary  for  correct  switching  in  a 
sequence  LRRLLRRL  (L  meaning  switch-on  of  the  left  element,  and  R  of  the  right  element  in  the 
pair).  The  figure  shows  that  0.25  pJ  is  sufficient  for  correct  switching.  This  corresponds  to  0.2 
fJ/jim2.  Figure  2  demonstrates  the  necessity  to  use  the  carrier-extractipn  pulse:  the  optical  input 
energy  increases  with  2  orders  of  magnitude  when  the  amplitude  of  the  carrier-extraction  pulse  is 
not  sufficient  for  completely  depleting  the  PnpN  devices  of  the  pair  during  the  first  phase  of  the 
operation  cycles.  It  should  be  noted  here  that  all  PnpN-based  optoelectronic  switches  published 
so  far  have  a  layer  structure  which  does  not  permit  complete  carrier  extraction  by  an  extraction 
pulse. 

Figure  3  gives  a  general  overview  of  the  performance  of  acknowledged  optoelectronic  switches, 
including  S-SEED,  FET-SEED,  HPT+LED,  VSTEP,  PNPN.  Two  figures-of-merit  are  plotted:  the 
required  optical  switching  energy  per  unit  area  and  the  achievable  cycle  frequency.  Our  switch 
shows  the  largest  sensitivity-bandwidth  product,  despite  the  fact  that  we  have  performed  no 
optimisation  as  to  its  speed  (10-fold  improvement  should  be  achievable).  Additional  advantages 
are  the  non-critical  operation  voltages  and  input  wavelength,  simple  and  short  epitaxial  growth  and 
relaxed  lithographic  constraints. 

[1]  K.  Hara,  K.  Kojima,  K.  Mitsunaga  and  K.  Kyuma,  IEEE  J.  Quant.  Electr.,  vol.  28,  no5,  1992. 

[2]  P.  Heremans,  M.  Kuijk,  R.  Vounckx  and  G.  Borghs,  “The  completely  depleted  PnpN  optoelectronic 
switch”,  abstract  sent  in  to  Optical  Computing  *94,  Edinburgh,  August  1994. 
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Optical  pulse  duration  (ns) 

EisiuxJ  :  Measured  external  optical 
switching  energy  of  0.25  pi/ 1200  pm2 
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Figure  2.  The  necessity  of  the  carrier-extraction 
pulse:  insufficient  carrier  extraction  results  in  2 
orders  of  magnitude  larger  optical  input  energy 


1  S-SEED  (AT&T) 

2  FET-SEED  (AT&T) 

3  N1PI -modulator  (Erlangen) 

l 

4  laser  VC-VSTEP  (NEC) 

5  HPT+LED  (Matsushita) 

6  PNP(N)+VCSEL  (New  Mexico) 

7  Differential  PNPN  (Mitsubishi) 

8  This  work  (IMEC-VUB) 


Figure  3:  Optical  switching  energy  per  unit  detector  area  versus  cycle  frequency  of  optoelectronic 
switches.  The  reports  not  mentioning  a  cycle  frequency  have  been  grouped  under  “single  shot  operation” 
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Abstract 

The  nonlinear  optical  properties  of  hybrid  AlAs/GaAs  Bragg  reflectors  are 
presented.  Experimentally,  optical  bistability  at  optical  intensities  as  low  as 
35  mWcm-2  is  obtained  when  a  bias  voltage  of  115  V  is  applied. 

Summary 

In  1992  Cada  et  al.  [1]  observed  optical  nonlinearity  in  a  periodically  layered  III- V 
semiconductor  structure  consisting  of  pairs  of  optically  linear  ( AlAs)  and  nonlinear  (GaAs) 
layers  and  optical  bistability  was  found  at  optical  intensities  Iin  of  about  10  KWcm-2.  In 
the  same  year,  optical  bistability  was  achieved  by  the  authors  in  a  hybrid  multilayered 
AlGaAs  structure  at  optical  intensities  of  merely  1.4  Wcm-2  at  a  wavelength  of  885  nm 
using  a  bias  voltage  applied  perpendicular  to  the  layers  [2].  And  a  physical  model  for  this 
hybrid  case  has  been  developed  recently  using  a  phenomenological  approach  [3].  Moreover, 
Ivanov  and  Haug  [4]  formulated  analytical  equations  for  this  hybrid  case  leading  to  optical 
intensity  levels  of  less  than  1  Wcm-2.  Additionally,  they  predicted  switching  times  in  the 
order  of  100  ns  giving  characteristic  energy  densities  smaller  than  100  nJcm-2. 


Fig.  1:  Sketch  of  the  hybrid  multilayered 
structure. 


In  this  paper,  a  hybrid  Bragg  reflector  is  investigated  consisting  of  20  pairs  of  GaAs 
and  AlAs  layers  with  nominal  thicknesses  of  58  nm  and  69  nm,  respectively  (Fig.  1).  In  a 
first  experiment,  the  reflection  is  measured  as  a  function  of  the  incident  optical  intensity  Iin 
at  a  wavelength  X  of  879.5  nm  and  using  an  impressed  voltage  V0,  see  Fig.  2.  Obviously, 
starting  at  low  intensities  the  reflection  first  decreases  with  increasing  I;n.  Further,  at 


TuC2/l80 


Fig.  2:  Optical  bistability  measured  at  Fig.  3:  Intensity  for  the  switching  between 
room  temperature.  lower  and  higher  reflection  state  versus 

reciprocal  externally  applied  voltage. 


a  threshold  intensity  Ith,  in  this  case  180  mWcm-2,  the  reflection  switches  to  a  higher 
state.  As  can  be  depicted  from  Fig.  2,  the  reflection  contrast  ratio  is  2.0  dB.  Lowering 
the  optical  intensity  again,  the  reflection  remains  almost  constant  at  the  higher  level.  At 
Iui  =  32  mWcm-2  the  reflection  switches  back  to  the  first  state  and  a  hysteresis  loop  is 
formed.  As  a  key  result,  the  optical  nonlinearity  is  enhanced  by  an  order  of  magnitude 
as  compared  to  the  results  of  1.4  Wcm-2  in  [2]. 

In  a  further  experiment,  the  threshold  intensity  Ith  is  determined  as  a  function  of  the 
external  voltage  V0.  From  the  plot  in  Fig.  3,  it  is  concluded  that  Ith  is  proportional  to 
1/V0  -f  const.  As  can  be  seen,  optical  bistability  can  be  found  at  optical  intensities  as 
low  as  35  mWcm-2  at  a  bias  voltage  of  115  V.  Consequently,  using  a  spot  diameter  of 
20  pm  an  optical  threshold  power  of  440  nW  is  expected  for  bistability.  Hence,  an  array 
of  100  *  100  elements  can  easily  be  switched  using  a  laser  diode  with  an  optical  power  of 
only  10  mW. 

In  summary,  it  is  shown  that  hybrid  multilayered  semiconductor  structures  exhibit 
huge  optical  nonlinearities  leading  to  potential  applications  for  low  intensity  optical  com¬ 
puting  devices. 
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We  report  on  experimental  results  on  the  dynamical  behaviour  of  n-i-p-i-based  smart 
pixels.  With  switching  energies  of  2.4  fJ/pm2  contrast  ratios  of  4:1  at  1.6  mW  output 
power  were  achieved.  The  opto-optical  gain  is  tunable  from  10 -106. 


Lately  we  have  demonstrated  a  new  smart  pixel  concept  [l]  composing  of  a  photo- 
conductive  switch  with  high  electrical  gain  [2]  and  a  high  contrast  electro-optic  n-i-p-i 
modulator  [3].  A  schematic  picture  of  our  smart  pixel  concept  is  shown  in  figure  1. 
One  advantage  of  such  a  hybrid  concept  is  that  both  elements  (switch  and  modulator) 
can  be  optimized  separately.  Therefore  the  switches  can  have  high  responsivity  and 
high  photoconductive  gain  and  the  modulators  can  have  high  contrast  and  low  insertion 
loss.  Due  to  the  high  photoconductive  gain  of  the  switch  only  a  small  optical  input 
Ptw  is  necessary  to  controll  the  much  larger  output  power  of  the  n-i-p-i  modulator. 
So  pure  opto-optical  logic  devices  with  high  optical  gain  are  realized  requiring  only 
a  dc  voltage  for  operation.  By  choosing  a  suitable  design  of  the  switch  one  can  obtain 
a  high  electro-optical  gain  and  extremely  low  switching  energies.  We  minimized  the 
capacitance  of  the  switch  by  using  a  sophisticated  sample  design  with  a  small  detection 
area  and  a  spatially  separated  large  absorption  area,  which  doesn't  contribute  to  the 
device  capacitance  [4].  In  figure  2  experimental  results  on  the  dynamical  switching 
behaviour  of  the  photoconductive  switch  are  shown.  With  an  optical  power  Psw  of 
880  pW  a  switching  time  in  the  n-layer  of  1.9  ns  was  achieved.  This  corresponds  to 
a  switching  energy  of  1.7  pj  (=  2.4  fJ/pm2  referring  to  the  area  of  this  device 
(0  =  30  pm)  ).  For  this  low  switching  time  the  opto-electrical  gain  is  still  40. 


reference 

diode 


iod 


Figure  1  :  Schematic  picture  showing 
our  smart  pixel  concept  composing  of 
an  opto-  electrical  switch,  a  reference 
diode  and  a  high  contrast  electro-optical 
n-i-p-i  modulator.  In  this  circuit  the 
large  modulator  output  power  PQUt  is 
controlled  by  the  small  input  power 
Pfw  on  t*,e  switch. 
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By  combining  the  photoconductive  switch  with  a  high  contrast  n-i-p-i  modulator,  as 
described  in  Ref.  3,  we  obtained  opto-optical  switching.  Depending  on  wether  the  switch 
is  in  the  high  or  low  resistance  state  the  voltage  (see  figure  l)  drops  either 

across  the  switch  or  the  modulator.  If  the  voltage  drops  across  the  switch,  the  modulator 
is  in  its  transparent  state  and  then  the  optical  output  signal  Pout  is  high.  The  opto- 
optical  switching  behaviour  for  'ious  reference  power  Pref  is  shown  in  figure  3.  In 
this  case  the  optical  output  s  jumps  from  400  pW  to  1.6  mW  corresponding  to 
an  on/off  ratio  of  4:1.  By  ch^ ..amg  the  reference  power  the  switching  point  can  be 
adjusted  externally.  Depending  on  the  switching  power  an  opto-optical  gain  from  10 
to  101 2 3 4 * 6  has  been  achieved. 


Figure  2:  Dynamical  switching  behaviour 
in  the  n-layer  current  I  of  the  photo- 
conductive  switch.  In  this  case  the 
optical  switching  power  P>w  was  880  pW 
and  the  switching  time  tbw  was  1.9  ns, 
corresponding  to  a  switching  energy  of 
1.7  pJ  for  this  0  =  30  pm  device. 


Input  Power  (W) 

Figure  3  :  Opto-optical  switching  with 
the  smart  pixel.  For  various  optical 
power  levels  Pfef  on  the  reference  diode 
the  diagram  shows  the  output  power  of 
the  n-i-p-i  modulator  controlled  by  the 
input  power  on  the  photoconductive 
switch. 
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Abstract 

Theory  and  experiment  on  crosscoupled-mode  bistability  in  a  twin-stripe  laser  is  reported.  The 
laser  has  two  output  ports  complementary  to  each  other,  which  is  analogous  to  a  set-reset 
flipflop  in  electronics. 


Theory 

Fig.l  illustrates  top  views  of  a  twin-stripe  laser  to  show  how  a  cross-coupled  mode  [1] 
buiilds  up  [2-4].  Assume  that  the  light  power  and  the  carriers  are  mainly  in  each  of  the  two 
waveguides  under  the  stripes,  the  cunent  is  uniformly  injected  into  the  stripes,  and  the  cavity 
length  is  near  to  the  coupling  length  of  the  twin  waveguide.  The  explanation  starts  with 
symmetric  light  pattern  at  both  facets.  Suppose  a  fluctuation  which  leads  to  a  slightly 
asymmetric  pattern  near  a  facet  (z=0)  as  shown  in  Fig.  1  (a).  This  field  pattern  changes  during 
one-way  propagation  to  the  other  facet  (z=L),  where  it  becomes  nearly  the  minor  image  (left 
and  right  is  reversed)  of  the  light  pattern  at  z=0,  as  shown  in  Fig.  1(b).  During  this 
propagation,  this  light  consumes  more  carrieres  at  the  right-lower  and  left-upper  regions  of  die 
twin  waveguide  than  at  the  other  regions.  We  should  now  examine  how  the  resultant  diagonal 
carrier  distribution  react  to  the  light  field  in  the  next  step. 

Intuitively,  rare  may  expect  that  the  carrier  distribution  will  enhance  the  light  power  at 
the  regions  with  high  carrier  density,  which  makes  the  light  pattern  return  to  the  symmetric 
shape  because  of  negative  feedback.  On  the  contrary,  however,  it  was  theoretically  shown  that 
the  diagonal  carrier  distribution  enhances  the  light  power  at  the  regions  with  low  carrier 
density  and  makes  the  original  light  pattern  more  asymmetric,  if  the  cavity  length  is  shorter 
(longer)  than  the  coupling  length  for  twin-stripe  lasers  with  low  (high)  inter-stripe  gain  [2-4]. 
This  happenes  because  of  both  lateral  and  longitudinal  resonance,  and  the  light  confinement 
difference  between  the  modes.  The  light  and  the  carrier  distributions  enhance  the  asymmetries 
of  each  other  due  to  the  positive  feedback  and  finally,  both  have  substantially  asymmetric 
patterns  as  illustrated  in  Fig.  1(c).  If  the  original  light  fluctuation  is  such  that  the  left  peak 
(instead  of  the  right  peak  as  shwon  in  Fig.  1(a))  becomes  larger  at  z=0,  then  another  stable  state 
with  mirror  images  of  the  light  and  the  carrier  distributions  shown  in  Fig.  1(c)  is  obtained. 


(a)  Fluctuation  leads  to  (b)  After  one-way  (c)  Final  stable  state 
sflghtly  asymmetric  propagation 
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(d)  Set-reset  flip-flop 


TuC4/184 


( 


Thus,  there  are  two  stable  states  drawn  in  Fig.  1(d)  in  solid  and  broken  curves, 
respectively.  Hie  laser  can  be  switched  from  a  crosscoupled  state  to  the  other  by  light 
injections.  The  operation  is  similar  to  a  set-reset  flip-flop  in  electronics  particularly  in  that  the 
i*s*t  has  two  mutually  complementary  outputs,  while  bistable  semiconductor  lasers  reported  so 
far  have  usually  only  one  output  and  hence  require  an  additional  inverter  to  get  the 
complementary  output  Thus,  twin-sripe  lasers  in  crosscoupled-mode  operation  should  be 
useful  for  optical  switching  and  logic  operation. 

Experiment 

The  lasers  made  for  measurement  has  a  similar  structure  with  that  in  [5].  It  has  an 
ordinary  double  heterostructure  with  a  0.1  pm  thick  GaAs  active  layer  sandwiched  by  two 
AIq  35GaQ  65  As  cladding  layers.  Two  2-pm-wide  stripe  anodes  with  4  pm  spacing  were 
formed  on  4-pm-widc  mesas  made  by  wet  etching.  The  wafer  was  cleaved  to  make  lasers  with 

typical  cavity  length  of  500  pm  which  is  estimated  to  be  near  to  the  coupling  length. 

Fig.  1  (a)  and  (b)  show  near-field  patterns  measured  with  the  left  and  right  currents  fixed 
at  90mA  and  95mA,  respectively.  They  show  the  two  crosscoupled  modes  whose  patterns  are 
the  mirror  images  of  each  other.  When  we  measured  the  field  patterns  with  pulse  (hive, 

Fig.  1(a)  was  obtained  at  some  pulses  and  (b)  was  obtained  at  the  other  pulses.  This  shows 
bistability  between  two  crosscoupled  modes. 

Cavity  length  limitation 

The  cavity  length  should  be  near  to  the  coupling  length  of  the  twin  waveguide  to  give  a 
highly  asymmetric  light  patterns.  To  reduce  the  laser  length  for  crosscoupled  mode  generation, 
the  coupling  between  the  two  waveguides  should  become  stronger.  The  strongest  limit  is  that 
the  two  waveguides  are  combined  into  one. 

Therefore,  the  possibly  shortest  laser  length 
can  be  estimated  by  the  propagation  constant 
difference  between  the  fundamental  and  first 
order  lateral  modes  of  a  single  waveguide. 

For  GaAs- AlGaAs  doublehetero  structure, 
it  is  estimated  as  short  as  a  few  micrometers. 
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Fig.2  Measured  near-field  patterns 
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Abstract 

The  fundamental  modulation  characteristics  of  waveguide  type  frequency  shifter  based 
on  the  rotating  phase  plate  are  demonstrated  on  (110)  GaAs  substrate  as  an  optical 
device  for  multi-dimensional  interconnection  for  the  first  time. 

A  large  number  of  activities  on  2  dimensional  optical  devices  for  free  space 
optical  interconnection  have  been  done  in  order  to  attain  a  large  interconnection 
through-put  for  optical  computing.  In  addition  to  the  space  domain  processing, 
frequency  domain  data  processing  can  realize  the  larger  through-put  by  means  of  a 
multi-dimensional  interconnection  technology. 

Here,  a  frequency  shifter  would  be  one  of  the  most  important  devices,  because  it 
may  realize  the  multi-dimensional  interconnection  as  well  as  the  FDM  crossconnect 
technology  in  communication  network.  Some  frequency(wavelength)  conversion 
devices  such  as  semiconductor  optical  amplifiers  utilizing  FWM  effect  and  bistable 
laser  diodes  are  reported  so  far.  These  have  advantageous  features  of  high  conversion 
efficiencies(>OdB)  and  large  frequency  wavelength)  shift(>10nm).  However,  there  are 
also  drawbacks  of  limits  on  signal  bandwidth(<~GHz)  and  signal  modulation  code(only 
applicable  to  intensity  modulated  signal).  Rotating  half-wave  plate1),  in  principle,  can 
realize  bit-rate  free  and  modulation  code  free  frequency  conversion  since  it  stands  on 
Doppler  shift 

In  this  paper,  we  report  the  fundamental  modulation  characteristics  of  waveguide 
type  frequency  shifter  based  on  the  rotating  phase  plate  for  the  first  time.  We 
successively  demonstrate  parallel  and  perpendicular  phase  modulation  characteristics 
for  the  effective  azimuth  control  of  the  phase  plate. 

Figure  1  shows  the  principle  of  the  frequency  shifting  of  die  rotating  half-wave 

plate.  The  angular  frequency  of  right(left)  circularly  polarized  incident  light(o))  is 
converted  to  to-ojp  on  the  rotating  phase  plate  in  clockwise(couter  clockwise)  due  to 

Doppler  shift  Here,  n>p  denotes  the  angular  frequency  of  die  rotaing  phase  plate.  If  die 
phase  plate  is  a  half-wave  plate,  the  output  light  becomes  the  left(right)  circularly 
polarized  light  with  the  angular  frequency  of  oo-2o}p.  This  process  does  not  depend  on 
the  signal  bit-rate  and  the  modulation  code  of  incident  light  Therfore,  the  optical 
devices  based  on  this  principle  can  be  applicable  to  the  bit-rate  free  and  die  modulation 
code  ftee(not  only  to  intensity  modulated,  but  also  to  FSK  or  PSK  coded  signals) 
frequency  shifter.  The  rotating  phase  plate  could  be  realized  by  applying  die  rotating 
electric  field  to  EO  material  effectively. 

A  schematic  view  and  a  cross  sectional  view  of  die  fabricated  device  are  shown  in 
FigJ2.  The  waveguide  was  formed  by  RIBE,  after  successive  growth  of  AluGaojAs 
lower  dad,  GaAs  guide  and  Al«jGaa7As  upper  clad  layers  on  a  (110)  GaAs  substrate 
being  6*  off  towards  (111 ) A.  All  layers  were  undoped  and  grown  by  MBE.  Because  the 
light  propagation  direction  should  be  in  the  3-fokl  axis  of  GaAs(<l  1 1>),  the  waveguide 

was  designed  to  be  S-shaped,  which  was  inclined  to  (IK))  cleaving  facets  by  35*.  3 
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Schottky  electrodes(Cr/Au),  one  of  them  being  on  the  waveguide  and  two  of  diem 
being  besides  the  waveguide,  were  evaporated  to  apply  the  rotating  electric  field  to  the 
waveguide.  Figures  3  and  4  show  a  photograph  and  an  example  of  the  phase 

modulation  characteristics  erf  the  device,  respectively.  A  DFB  laser  diode(X«1.3pm) 


perpendicular  field  respectively.  As  shown  in  Fig.4,  obtained  V*  values  for  the  parallel 
and  perpendicular  field  were  10V  and  13V,  respectively,  "njereforc,  by  applying 
timiMiMtai  electric  fields  to  the  electrodes  with  adequate  phases,  the  effective  rotating 
azimuth  erf  die  phase  plate  can  be  induced  as  the  waveguide  frequency  shifter  based  on 
Doppler  shift  These  V*  values  are  so  high  to  attain  the  large  frequency  shift  because  of 
neccessity  of  the  high  voltage  and  high  frequency  electric  driver.  However,  by 
introducing  large  EO  effect  by  quantum  size  effect  such  as  QCSE2),  we  beleive  the 
issue  can  be  overcome. 

In  conclusion,  the  fundamental  modulation  characteristics  of  waveguide  type 
frequency  shifter  based  on  the  rotating  phase  plate  have  been  repoted  as  an  optical 
device  for  multi-dimensional  interconnection  for  the  first  time.  Demonstrated  parallel 
and  perpendicular  phase  modulation  characteristics  for  the  effective  azimuth  rotation  of 
the  phase  plate  indicated  the  possibility  of  the  bit-rate  and  modulation  code  free 
frequency  shifter  in  practical. 
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Afegiiast 

The  asymmetric  dragging  interaction  between  three-dimensional  optical  solitons  may  allow  cas- 
cadable,  phase-insensitive,  NOR  gates  with  gain  to  be  implemented  at  ultrahigh  speeds  in  mas¬ 
sively  parallel  three-dimensional  bit-level  systolic-array  architectures. 


In  media  that  have  both  a  self-focusing  non¬ 
linearity  (n2  >  0)  and  negative  group  velocity 
dispersion,  a  pulse  can  collapse  in  both  space 
and  time  forming  a  stable  “light  bullet”  in  3+1 
dimensions .l1-2)  These  collapsed  optical  pulses 
have  high  peak  powers  but  small  total  ener¬ 
gies,  making  them  attractive  for  nonlinear  op¬ 
tics  applications.  These  three  dimensional  soli¬ 
tons  can  be  made  to  interact  to  produce  a  logic 
gate  in  the  same  way  that  one-dimensional  spa¬ 
tial  (or  temporal)  solitons  do.  We  are  investi¬ 
gating  the  asymmetric  interaction  between  two 
orthogonally  polarized  solitons  brought  into  co¬ 
incidence  both  spatially  and  temporally  at  the 
boundary  of  a  nonlinear  medium  and  propa¬ 
gating  at  slightly  different  angles.  This  geom¬ 
etry  permits  a  weak  signal  to  drag  a  strong 
pump  by  well  over  a  beam  width,  allowing  them 
both  to  be  blocked  by  a  spatial  aperture^3! 
This  results  in  a  phase  insensitive  inverter  with 
gain  that  can  be  cascaded  to  implement  a  high 
contrast  NOR  gate  that  transmits  an  uncor¬ 
rupted  pump  through  the  aperture  if  the  sig¬ 
nals  are  not  present,  and  drags  the  pump  out 
of  the  spatial  aperture  so  that  it  is  blocked  if 
they  are  present.  Simultaneous  multidimen¬ 
sional  dragging  by  two  signals  is  possible  for 
light  bullets,  allowing  the  single  stage  imple¬ 
mentation  of  NOR  gates.  A  beam  propaga¬ 
tion  simulation  of  an  asymmetric  light-bullet 
dragging  interaction  is  shown  in  Figure  1. 
This  shows  a  dragging  of  a  6/imx6/imxll/mi 
3.3pJ  pump  (/p=140MW/cm2)  by  a  .7pJ  sig¬ 
nal  (J,=20MW/cm2)  in  about  1.5mm  of  prop¬ 
agation  distance  using  a  rather  large  saturating 
nonlinearity  of  nf  =  10-16m2/V2. 

In  order  to  optimize  these  light  bullet  drag- 
png  gates  without  the  huge  computational  over¬ 
head  of  the  3+1  dimensional  beam  propagation 


simulations,  we  are  examining  the  properties 
of  ID  asymmetric  spatial  soliton  dragging  in¬ 
teractions,  as  illustrated  in  Figure  2.  As  an 
example  we  illustrate  the  output  contrast  ratio 
(wrt  the  fundamental  soliton  power)  in  a  two- 
level  system  saturating  nonlinear  medium  (in 
this  case  I3at  is  4  times  the  nonsaturated  fun¬ 
damental  soliton  peak  intensity),  as  a  function 
of  interaction  angle,  propagation  distance  and 
pump-to-signal  beam  ratio  in  Figure  3. 

Layered  sandwiches  of  nonlinear  media, 
aperture  arrays  and  linear  media  suggest  the 
possibilty  of  computing  in  3-D  with  asymmet¬ 
ric  light  bullet  dragging  gates.  But  getting  the 
signals  and  pumps  to  the  desired  interaction 
sites  without  disruption  by  unwanted  signals 
may  be  difficult.  It  may  be  necessary  to  systoli- 
size  at  the  bit  level  in  3-dimensions  by  pulsing 
the  clock  pumps  and  signals  so  that  they  pass 
through  each  other  in  intervening  layers  of  lin¬ 
ear  media  until  arriving  at  the  desired  dragging 
logic  site,  and  to  program  the  functionality  of 
the  array  of  logic  gates  by  the  presence  and  ab¬ 
sence  of  clock  pumps  in  the  space-time  lattice 
of  possible  light  bullet  locations.  An  example 
of  a  1-D  spatial  soliton  dragging  logic  cascaded 
majority  logic  circuit  is  illustrated  in  Figure 
4,  demonstrating  that  more  complex  functions 
than  simple  NOR  gates  can  be  implemented 
without  intervening  interconnections. 

Ultrafast,  massively  parallel,  low  latency, 
all  optical  NOR  gates  with  gain,  cascadabil- 
ity,  input-output  isolation,  and  phase  insensi¬ 
tivity  have  been  proposed,  numerically  demon¬ 
strated  and  parametrically  optimized.  This  op¬ 
tical  switching  interaction  opens  up  new  archi¬ 
tectural  possibilities  for  computing  in  3+1  di¬ 
mensions  that  may  allow  the  realization  of  vol¬ 
ume  parallel  digital  optical  computers. 
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Figure  1:  Light  bullet  dragging  by  more  than 
a  beam  width.  The  two  orthogonally  polarized 
light  bullets  are  represented  with  a  mesh  and 
a  solid  surface  thresholded  at  .1%  of  the  peak 
pump  intensity,  and  are  propagating  vertically 
with  an  initial  1  degree  angle. 


Figure  3:  Output  contrast  versus  interaction 
angle  and  pump-to-signal  ratio  for  a  propaga¬ 
tion  distance  of  5  confocal  distances,  and  con¬ 
trast  versus  interaction  angle  and  propagation 
distance  for  a  beam  ratio  of  5. 


|  pump,  no  mqnat  |  pump+aignol  |  no  pump  | 


Figure  2:  Asymmetric  1-D  spatial  soliton  drag¬ 
ging,  showing  pump  output  through  the  aper¬ 
ture  as  an  inversion  of  the  signal  input. 


Figure  4:  Cascaded  3-level  simulation  of  ma¬ 
jority  logic  circuit  showing  that  the  first  pump 
implements  a  NOR  that  is  blocked  by  either 
signal,  and  the  second  pimp  is  only  blocked 
when  2  or  more  off-axis  signals  are  present. 
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ABSTRACT 

Dramatic  improvements  in  microprocessor  price  /performance  have  challenged  the  supercomputer 
designer  to  achieve  high  levels  of  system  performance  by  interconnecting  hundreds  or  even  thousands  of 
processors.  But  to  achieve  high  sustained  computational  rates  it  is  imperative  that  designs  be 
balanced  with  respect  to  processor  speed,  local  and  global  memory  bandwidth,  input /output 
capability,  and  interprocessor  synchronization  primitives.  This  talk  will  discuss  these  criteria  using 
examples  from  the  recently  announced  CRAY  T3D  system. 


It  has  been  more  than  twenty  years  since  the  supercomputer  industry  was  more  or  less  defined  by 
the  CRAY-1.  Very  significant  technological  changes  have  occurred  since  then.  Most  compelling  are  the 
great  advances  in  integrated  circuit  density  and  speed.  These  advances  have  spawned  multiple 
generations  of  microprocessors  and  inexpensive  semiconductor  memory.  In  fact,  the  latest 
microprocessors  now  challenge  the  performance  claims  of  the  original  CRAY-1  but  at  small  fraction 
of  the  cost.  There  is  naturally  a  strong  incentive  to  try  to  achieve  high  processing  rates  across  large 
numbers  of  these  relatively  inexpensive  microprocessors.  However,  realizing  the  aggregate  potential 
performance  of  such  a  system  has  proven  to  be  quite  challenging. 

First,  the  problem  and  selected  algorithm  must  be  highly  parallel.  As  is  well  known,  Amdahl's  law 
observes  that  for  practical  problems,  the  total  execution  time  will  consist  of  a  parallel  portion 
parceled  out  to  a  collection  of  processors  plus  a  serial  portion.  If  large  numbers  of  processors  are 
frequently  forced  to  wait  for  the  computation  of  data  coming  from  a  single  processor  in  the  system 
plus  the  additional  time  required  to  distribute  the  data,  the  actual  benefit  of  the  large  numbers  of 
processors  will  be  reduced.  At  some  point,  adding  more  processors  is  no  longer  cost  effective  in 
reducing  the  time  to  solution.  Even  for  a  problem  which  is  95%  parallel,  one  can  not  expect  speedups 
beyond  20  for  even  an  arbitrarily  lar^e  number  of  processors.  Massively  parallel  (or  MPP)  systems  of 
hundreds  of  processors  require  problems  that  are  more  than  99%  parallel. 

Second,  the  problem  should  exhibit  high  algorithmic  efficiency.  Each  calculation  should  effectively 
move  the  calculations  toward  the  final  solution.  A  high  sustained  floating  point  operation  rate  is 
diminished  in  value  if  the  algorithm  demands  many  more  intermediate  results  than  a  competing 
algorithmic  choice.  Such  false  achievements  that  do  not  contribute  to  reducing  the  time-to- solution 
have  caused  a  great  deal  of  confusion  in  the  marketing  of  competing  MPP  systems. 

Third,  it  must  be  possible  to  efficiently  move  dependent  data  to  requesting  processors.  This 
communication  overhead  can  have  the  same  effect  as  the  serial  portion  of  the  computation. 
Algorithms  of  choice  require  only  modest  amounts  of  interprocessor  communication  or  else  allow 
mechanisms  in  the  hardware  to  effectively  make  the  latency  transparent  when  coupled  with 
software  prefetching  techniques. 

There  are  two  distinct  architectural  choices  for  which  cooperation  among  large  numbers  of  processors 
might  be  enforced.  The  Single  Instruction  Multiple  Data  (or  SIMD)  model  causes  each  processor  to 
operate  in  lock  step  with  all  other  processors  in  the  partition  but  with  different, r  arallel  data  sets. 
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The  Multiple  Instruction  Multiple  Date  (or  MIMD)  model  allow*  each  processor  to  follow  its  own 
control  flow,  executing  from  distinctly  different  instruction  streams. 

SIMD  designs  are  a  good  match  for  data  parallel  programming  models  and  can  be  somewhat 
simpler  to  program.  But  these  suffer  from  inefficiencies  when,  for  example,  sparse  matrix  data  is 
being  manipulated  in  rather  naive  ways  with  lots  of  multiplications  or  additions  of  zero  operands. 
Although  the  operation  count  may  be  high,  the  algorithmic  efficiency  is  low.  MIMD  systems  provide 
many  more  degrees  of  freedom  with,  conceptually,  each  processor  operating  from  distinctly  different, 
optimized  control  stores.  Whereas  synchronization  is  implicit  for  the  SIMD  model  with  all  parallel 
calculations  occurring  simultaneously,  it  is  necessary  in  the  MIMD  case  to  manage  more  explicitly 
the  limited  interaction  among  processors.  Although  somewhat  more  difficult  to  program,  the 
opportunity  for  increased  hardware  utilization  has  caused  a  shift  in  essentially  all  recent  MPP 
designs  toward  the  MIMD  domain. 

Uniform  access  memory  sub-system  have  worked  effectively  for  vector  supercomputers  but  do  not 
scale  well  to  large  number  of  processors  because  the  required  number  of  independent  data  paths 
grows  exponentially.  The  non-uniform  p  sically  distributed  memory  model  provides  a  more  scalable 
solution  with  each  processor  located  physically  nearby  a  set  of  memory  devices.  The  processors  can 
then  have  preferential  access  (that  is,  lower  latency  and/or  higher  bandwidth)  to  the  nearby 
memory.  The  non-uniform  memory  access  model  can  provide  a  much  nr”-e  cost  effective  solution  for 
parallel  algorithms  that  emphasize  a  high  degree  of  local  computation  and  only  moderate  amounts 
of  global  data  movement.  Whereas  the  high  degree  of  connectivity  in  the  uniform  access  memory 
system  may  actually  be  under  utilized  a  significant  part  of  the  time. 

Whether  the  memory  is  physically  shared  or  distributed,  there  is  another  significant  architectural 
choice:  the  logical  addressing  mechanism  among  processors  determining  whether  the  system  is  a 
multi-computer  or  a  true  multi-processor.  Many  MPP  systems  are  designed  as  multi-computers  with 
interprocessor  communication  depending  upon  explicit  I/O  based  requests.  Data  moving  from  one 
processor  to  another  must  pass  through  the  I/O  ports  of  the  processors.  This  usually  requires  the 
use  of  processor  interrupt  mechanisms  and  even  intervention  by  the  operating  system.  The  target 
processor  must  deer,  ie  or  interpret  via  software  techniques  the  address  tag  and  the  purpose  of  the 
message  to  determine  the  exact  source  or  destination  for  the  payload.  This  results  in  significant 
increases  in  latency  for  the  interprocesaor  communication  process.  A  multi-processor  on  the  other 
hand  provides  direct  paths  to  each  memory  location  within  the  entire  processor-pool  partition. 
Addressing  is  managed  through  a  single  global  address  space.  On  the  target  end  of  the  transfer,  the 
data  communication  mechanism  does  not  interrupt  the  processor,  but  proceeds  directly  via 
hardware  support  to  that  processor's  local  memory. 

Interprocessor  synchronization  must  be  fast  and  scalable.  Typical  RISC  microprocessor  designs 
anticipate  only  very  moderate  numbers  of  processor  working  together  within  a  workstation. 
Techniques  used  are  often  bus-based,  such  as  the  popular  bus  snooping  protocol  technique.  Due  to 
very  real  physical  constraints,  these  schemes  do  not  scale  well  beyond  about  a  dozen  processors. 
Software  and  I/O  based  synchronization  protocols  used  in  workstation  cluster  organizations  may 
scale  to  large  numbers  but  fall  far  short  of  the  goal  for  providing  single  microsecond  or  less  global 
synchronization  latency.  Much  more  effective  techniques  providing  barrier,  atomic  swap,  pre-fetch, 
and  message  queuing  approaches  are  possible  and  provide  for  more  efficient  hardware  utilization 
within  critical  regions  of  code  execution. 
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ABSTRACT 

A  mob-connected  bos  networking  topology  is  proposed  for  implementing  the  three-stage 
Clos  network  and  is  experimentally  demonstrated  nsfeg  a  WDMA  technology. 


We  consider  networks  for  interconnecting  parallel  computers  that  are  parameterized  by 
their  diameters  (number  of  switching  stages)  and  degree  (fan-out)  of  switches  which  are  used  to 
build  die  network.  Clos  used  switch  fan-out  to  describe  a  family  of  networks  that  range  from 
the  complete  network  that  has  maximum  fan-out  and  minimum  switching  diameter  to  the 
"Bcnes"  network  with  neatly  minimum  fan-out  and  larger  networking  diameter.  Electronic 
networks  tend  towards  die  small  fan-out/large  diameter  end  of  the  spectrum  due  to  limitations 
on  the  fan-out  of  electronic  switches.  The  fan-out  of  systems  of  optical  switches  is  anticipated 
to  be  significandy  higher  than  electronic  switches.  Thus,  it  has  been  suggested  that  optics  can 
be  used  to  build  the  complete  network  which  is  the  most  desirable  of  Qos's  family  of 
networks.  Even  optics,  however,  has  practical  limits  on  die  fan-out  of  switches  in  any  given 
configuration.  Thus,  in  this  paper,  we  describe  an  implementation  of  a  network  with 
significandy  smaller  degree  and  slightly  larger  network  diameter  than  die  complete  network. 
The  network  that  we  implement  is  called  the  mesh-connected  bus  network.  We  remark, 
however,  that  it  is  essentially  an  implementation  of  the  second  most  desirable  network  in  the 
family  of  networks  defined  by  Clos,  which  is  widely  referred  to  as  the  Clos  network. 


‘  wavelength 
channel  codes 


Hg.1:  A 16  node  WDM  MCB  netwodc  kyoot 


The  Clos  network  is  often  depicted  in  its  linear 
node  distribution  format.  We  remap  this  ID  format 
to  a  2D  linking  topology  (see  Fig.l  for  N«16 
nodes)  which  can  be  referred  to  as  the  mesh- 
connected  bus  (MCB)  topology  where  the  unee 
switching  stages  are  embedded  into  a  permutation 
along  a  row,  followed  by  along  a  column,  and 
finally  along  a  row  in  a  grid  structured  2D  node 
array.  It  can  be  shown  depending  on  the  number  of 
switches  used  in  each  row  and  column,  die  MCB 
offers  at  least  a  rearran gable  non-blocking 
switching  environment  We  also  summarize  other 
important  embedding,  off-line  as  well  as  on-line 
communication  features  of  die  MCB  network. 


To  optically  implement  the  MCB  interconnect  we  propose  to  use  the  wavelength- 
division  multiple  access  (WDMA)  concept  which  is  commonly  used  in  fiber  communication 
community  rfowever,  in  our  approach,  free- space  optical  components  such  as  cylindrical 
optical  lenses,  minors,  or  ID  gratings,  etc.  for  rooting  optical  signals  are  adopted  [1].  We  will 
snow  various  ways  of  realizing  die  proposed  optical  WDMA  MCB  network  and  analyze  the 
copqpondmg  fundamental  and  technological  limits. 
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The  proposed  system  concepts  are  experimentally  verified  through  routing  both  low  (10 
MHz)  and  high  (1.25  GHz)  bandwidth  optical  signals  (see  Fig.2  for  our  system  connection 
diagram)  [2].  In  the  low  bandwidth  situation,  some  base-band  video,  audio  and  RS-232  data 


wdma  tollable  filter 
HgJ2:  Experimental  letup  for  the  3-(tage  WDMA  MCB  interconnect. 


are  multiplexed  to  rill  up  die  10  MHz  bandwidth.  Conventional  Fabry-Ferot  lasers  operating  in 
the  range  of  1 290  -  1 340  nm  were  used.  Our  power  measurement  confirmed  that  a  fan-out  to  as 
many  as  36  channels  could  be  established  along  each  bus  making  a  network  linking  1,296 
nodes  power-wise  possible.  Through  3-stages  of  riee-space  routing  and  associated  electric -to- 
optical  and  optical-to-clectric  conversions,  an  overall  49  dB  signal-to-noise  ratio  far  video 
signal  reception  was  still  maintained.  The  images  shown  in  the  four  quadrants  of  Fig.  3  are  the 
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Rj.4:  Bit-cnw-me  test  remits  of  1.25  GWi 
bus  link. 


original  camera  image  of  our  setup,  and  the  received  images  at  the  first  through  the  last  stages  of 
our  network.  Our  high  bandwidth  experiment  involved  using  high-quality  DFB  lasers  operating 
around  1310  nm.  The  corresponding  optical  system  power  measurement  illustrates  that  at  the 
transmission  rate  of  1.25  Gb/s,  our  WDM  bus  array  can  accommodate  16  channels  per  bus  or 
equivalently  256  nodes  in  die  network  with  a  guaranteed  receiving  bit-etror-iate  of  lower  than  2 
x  10*13  (see  Fig.  4  for  the  bit-error  rate  measurement  results). 

[1]  Y.  Li,  A.  W.  Lohmann,  and  S.  B.  Rao  Tree-space  optical  mesh-connected  bus  networks 
using  wavelength-division  multiple  access"  AppL  Opt  32  (1993)  6425-6437. 

[2]  Y.  Li,  A.  W.  Lohmann,  Z.  G.  Pan,  S.  B.  Rao,  L  Redmond  and  T.  Wang,  "Optical 
multiple-access  mesh-connected  bus  interconnects,"  submitted  for  publication. 
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Abstract: 

We  discuss  the  issues  involved  in  building  demonstration  systems  integrating  GaAs  FET-SEED  smart  pixels, 
computer  generated  holograms,  2-D  fiber  bundles,  high-power  lasers,  high  resolution  optics,  and  novel  optome¬ 
chanical  packaging.  A  prototype  5  stage,  32x16  fabric  operating  at  153Mb^  is  described. 

Summary: 

An  exciting  new  trend  in  free-space  photonic  switching  and  computing  systems  is  the  use  of  smart  pixel  device 
arrays  which  combine  the  processing  power  of  electronics  with  the  interconnection  and  communication  advan¬ 
tages  of  optics.  A  recent  system  prototype  (Fig.  1)  implements  a  32-input,  16-output  multi-stage  switching  fabric 
using  $  stages  of  4x4  FET-SEED  “smart  pixel”  (2,1,1)  node  arrays,  and  Banyan  interconnections.!  1 ,2]  The  first 
experiments  with  this  system  achieved  50  Mb/s  operation  of  the  5  stages  with  15  active  inputs  and  outputs,  and 
155  Mb/s  operation  of  2  stages  with  8  of  the  15  active  nodes  functioning  correctly  along  their  “straight”  routing 
paths.  This  talk  presents  the  critical  design  and  testing  issues  for  this  system,  including  recent  modifications 
enabling  operation  of  5  stages  at  155  Mb/s,  with  32  active  inputs  and  16  active  outputs. 

Each  stage’s  optical  power  supply  combined  a  high  power  semiconductor  laser  diode  with  an  external  cavity  grat¬ 
ing  for  frequency  stabilization  and  a  computer  generated  binary  phase  grating  (BPG)  to  form  the  non-uniformly 
space  array  of  equal  power  spots  required  to  address  the  FET-SEED  modulators.  The  lasers  [3]  supply  60  mW  of 
optical  power  at  850  V-O.lnm.  The  BPGs  have  a  measured  efficiency  of  73%  (neglecting  Fresnel  losses)  and  a 
power  uniformity  of  93%.  A  custom  beam  combination  assembly,  consisting  of  2  polarizing  beam  splitters  is  used 
to  combine  the  power  supply  and  signal  beams  onto  the  node  array  by  creating  input  and  output  ports  for  the  sin¬ 
gle  f/1.5  Fourier  transform  lens  at  each  stage.  [4]  Each  FET-SEED  switching  node  consists  of  a  differential  optical 
receiver,  a  control  memory  to  store  a  routing  bit  extracted  from  the  data  stream,  and  a  multiplexer/modulator 
driver.  The  first  stage  array  has  11x11  pm2  detector  windows,  and  2  differential  optical  receivers  per  node  sepa¬ 
rated  by  120  pm.  Tested  individually,  the  node  circuits  operated  up  to  400  Mb/s  with  80  0  of  incident  optical  sig¬ 
nal  energy.  The  interconnections  between  stages  were  initial  performed  by  1x3  BPGs  between  stages.  System 
inputs  are  generated  by  8  lOnm  laser  diodes  attached  the  input  fibers,  which  are  arranged  in  an  8x4  bundle  on  a 
500  pm  pitch.  This  fiber  matrix  is  imaged  onto  the  first  node  array  through  a  dichroic  beam  combination  system 
similar  to  the  PBS  assemblies  of  the  other  stages,  except  that  PBS]  has  X.  /4  retarders  and  dichroic  mirrors  (trans¬ 
mit  850nm,  reflect  810nm)  attached  so  drat  the  unpolarized  light  from  the  input  fibers  is  directed  onto  the  first 
node  array.  A  custom  mounting  plate  similar  to  our  previous  system  [5]  is  used  to  mount  and  align  all  system 
components.  The  laser  units,  PBS  assemblies,  and  the  input  and  output  fiber  bundles  are  mounted  on  kinemati¬ 
cally  registered  sub-mounts  for  easy  assembly  and  replacement. 

Several  system  modifications  have  been  made  for  the  second  experiment  Design  and  processing  modifications 
have  increased  FET  currents  and  uniformity.  The  areas  of  the  modulators  have  been  increased  to  10x10  pm2  from 
7x7  pm2  (detector  areas  remain  at  7x7  pm2),  and  the  node  pitch  increased  from  210  pm  to  240  pm.  The  Bsmyan 
interconnections  between  stages  are  realized  by  metallization  on  the  node  arrays  rather  than  by  binary  phase  grat¬ 
ings  (BPGs),  providing  4  times  more  optical  signal  power  at  each  FET-SEED  receiver.  The  use  of  non- separable 
BPGs  rather  than  multi-level  suppressed  order  phase  gratings  for  spot  array  generation  provided  increased  light 
efficiency.  Population  of  all  32  input  fibers  enabled  testing  of  all  512  paths  through  the  fabric.  The  use  of  1300  nm 
SM  fibers  in  an  actively  aligned  fiber  bundle  eased  the  alignment  of  the  data  input  lasers,  but  since  they  support  4 
modes  at  810nm,  a  -2.2dB  modal  noise  tradeoff  was  mrodoced.  Fabrication  errors  in  the  input  bean  splitter 
assembly  have  been  corrected  by  the  addition  of  a  birefringent  wedge  arrangement 
An  automated  call-load  generation  system  was  developed  to  test  all  512  paths  through  the  5-stage  network.  At 
155  Mb/s,  45%  of  the  paths  were  functional  due  to  3  non-functional  pixels  and  1  low-powered  input  fiber.  The 
system  maintained  this  functionality  within  our  lab  (+/-1°  C,  Newport  table)  for  over  8  weeks.  The  development  of 
this  prototype  would  not  have  been  possible  without  the  combined  efforts  of  the  authors  listed  in  the  references. 
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Demonstration  of  optically  controlled  data  switching  using 
quantum  well  bistable  devices  and  modulators 

P.  Koppa,  P.  Chavd  (Institut  d'Optique-CNRS,  BP.  147  -  91403  Orsay  Cedex,  France) 
JJL.Oudar,  R.  Kuszdewkz  (Fiance  Telecom  ,  CNET  Paris-B,  BP.  107  -  9222S  Bagneux 
Cedex,  Fiance) 

JJ»h.  Schnell,  J.P.  Pocholle  (Thomson  CSF.  LCR,  Domaine  de  Corbeville,  91404  Orsay, 
France) 

Abstract: 

Experimental  results  on  a  64  channel  free-space  photonic  switching  system  are  presented. 
Two  control  schemes  are  demonstrated  :  direct  optical  addressing  with  potential  signal 
amplification  and  self-routing  operation  acting  cm  data  packets. 


We  present  a  1  to  64  free-space  optical  switch  which  can  be  operated  both  in  packet  switching  and 
circuit  switching  environments.  The  system  associates  a  spatial  light  modulator  0)  (SLM)  and  an 
optical  bistable  device  array,  both  based  on  GaAs/GaAlAs  Multiple  Quantum  Well  (MQW) 
structures.  The  SLM  is  an  8x8  array  of  individually  addressable  transmission  type  electro¬ 
absorption  modulators.  The  optical  bistable  device  consists  of  a  non-linear  Fabry-Perot  etalon. 
The  8x8  bistable  pixels  are  simply  defined  by  separated  light  spots  incident  on  the  same  cavity. 
When  maintained  in  the  bistable  region  of  operation  by  a  holding  beam  array,  the  devices  exhibit 
a  two  state  memory  effect  for  about  1  ms  (fig.  1/a).  With  a  slight  modification  of  the  detuning 
from  the  Fabry-Perot  resonance,  this  hysteresis  loop  can  be  transformed  into  a  simple 
thresholding  curve  (fig.  1/b). 


Fig.l.  :  Bistable  device  response  curves,  a.:  bistab te, 
b.:  simple  thresholding  operation 


In  the  packet  switching  mode,  self¬ 
routing  commutation  is  implemented  by 
optical  decoding  of  the  header  address 
preceding  each  date  packet.  This  address 
decoding  (3)  consists  in  fanning-out  the 
signal  into  64  channels,  comparing  bit  by  bit 
the  address  of  the  packet  to  the  address  of 
each  output  channel,  and  switching  off  all 
channels  where  the  identification  of  one  or 
more  bits  fails.  A  two  bit  coding  technique  is 
used  to  code  the  binary  addresses  :  each  bit  is 
followed  by  its  complementary  value,  so  bit 
(me  is  coded  by  a  high-low  sequence,  bit  zero 
by  a  low-high.  By  the  use  of  this  code, 
address  comparison  becomes  a  simple  bitwise 
multiplication  of  the  packet  address  by  the 
inversely  coded  channel  address,  introduced 
as  the  modulated  transmission  of  the  SLM 
pixel.  Any  mismatch  between  the  two 
addresses  is  turned  into  the  transmission  of  a 
high  level  address  pulse  by  the  modulator 
element. 


This  pulse  then  switches  down  the  corresponding  bistable  element  in  its  blocking  state  (point  B  on 
fig  1/a),  from  its  initial  reflective  state  (point  A).  Consequently,  the  only  bistable  element  staying 
in  reflective  state  will  be  that  of  the  destination  channel,  where  address  matching  is  perfect.  After 
the  date  packet  has  passed  through  this  channel,  all  bistable  devices  are  reset  to  the  initial  high 
reflection  state  to  receive  the  next  packet 
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Figure  3  shows  output  signals  obtained  by  single  channel  operation  of  the  system.  The  first 
oscilloscope  trace  shows  a  packet  addressed  to  the  observed  output  :  there  is  no  high  level  pulse 
due  to  address  mismatch,  so  data  are  transmitted  by  the  system.  The  last  three  traces  represent 
packets  with  different  addresses  which  do  not  match  the  address  of  this  output.  We  can  distinguish 
the  high  level  address  pulse  which  immediately  switches  off  the  bistable  device.  The  bistable 
devices  used  in  this  experiment  have  a  contrast  ratio  of  about  2:1,  so  theoretically  blocked  packets 
are  attenuated,  but  still  visible  at  the  output  The  data  rate  in  this  experiment  is  20  Mbit/s  and  total 
input  intensity  is  about  IS  mW.  System  operation  is  demonstrated  for  packet  lengths  up  to  8  kbits, 
with  a  typical  error  rate  of  1%. 

In  circuit  switching  mode  holding  beams  are  directly  used  to  control  bistable  device 
reflectivity.  To  achieve  this  functionality,  holding  and  signal  beams  are  interchanged  compared  to 
figure  2,  so  that  we  can  modulate  independently  each  holding  beam.  (  denoted  as  control  beams 
in  this  case.  )  Bistable  devices  are  set  to  non-bistable  thresholding  mode  (fig.  1/b). 


Fig.3  :  Output  signals  of  the  self-routing  operation  Fig.  4  : Output  obtained  by  direct  addressing 


Figure  4  shows  one  channel  operation  in  that  mode  :  when  the  control  beam  is  off  (point  A  of 
figure  1/b),  the  bistable  device  is  reflective,  so  the  signal  is  routed  to  the  output ;  when  the  control 
beam  is  on,  the  device  is  in  the  low  reflection  state,  so  the  signal  beam  is  mostly  absorbed.  In  this 
operation  mode  the  contrast  ratio  can  be  improved  up  to  a  value  of  4:1,  while  the  total  input 
intensity  is  lower  than  5  mW.  The  switching  threshold  is  about  1  mW,  as  opposed  to  3.5  mW  in  the 
bistable  mode  . 

This  operation  mode  can  be  transformed  into  active  switching,  by  setting  the  control  beam 
intensity  to  point  C  of  figure  1/b  (i.e.  very  close  to  the  switching  threshold  ).  If  the  additional 
signal  intensity  exceeds  the  threshold,  the  device  is  switched  by  each  data  bit  change,  so 
modulation  of  the  signal  is  transmitted  onto  the  control  beam.  As  the  control  beam  is  in  general 
more  intense  then  the  signal  beam,  data  amplification  can  be  obtained.  The  actual  gain  depends 
on  device  response  curve  and  signal  to  noise  ratio.  In  our  system  an  amplification  factor  of  nearly 
two  was  obtained,  but  with  other  bistable  devices  higher  gains  have  also  been  demonstrated^4). 
This  operation  scheme  presents  the  advantages  of  data  reshaping  and  amplification,  but  it  requires 
data-rate  switching  of  the  bistable  element,  limiting  the  data  rate  by  its  operation  speed.  (Rise-  and 
fall-times  of  the  order  of  10  ns  were  observed  for  our  actual  devices,  with  3mW  input  on  200  pm.) 
In  self-routing  operation  and  passive  circuit  switching  modes,  the  data-rate  is  not  limited  by  the 
device  speed,  since  the  set-up  is  transparent  to  data  bits. 

In  conclusion,  single  channel  operation  of  the  1  to  64  free-space  optoelectronic  switch  was 
presented  both  in  packet  switching  and  in  circuit  switching  mode.  Further  results  on  parallel 
operation  of  the  system  will  be  given,  as  well  as  a  study  of  system  limitations  due  to  crosstalk, 
uniformity  and  noise  effects. 

This  work  is  supported  by  the  French  Ministry  of  Research  under  the  project  MOTS/OSTI  91. 
References : 

1.  J.  Ph.  Schnell,  J.  Raffy,  J.  P.  Pocholle,  A.  Delboulbe,  G.  Dutrey,  J.  Lehoux,  M.  Werner, 

M.  Papuchon,  J.  P.  Huignard  ;  Technical  Digest  of  Spatial  Light  Modulators  1990,  Vol.14, 
pp  60-64 

2.  J.  L.  Oudar  ,  R.  Kuszelewicz  ,  B.  Sfez,  D.  Pellat,  R.  Azoulay;  Superlattices  and  Microstructures, 

12,  p.89  (1992) 

3.  P.Chavel,  P.Koppa,  J.Taboury,  J.L.Oudar  ,  R.  Kuszelcvicz,  J  .Ph.  Schnell,  J.  Raffy, 

J.P.  Pocholle,  M.  Papuchon  ;  Topical  Meeting  on  Spatial  Light  Modulators,  Palm 
Springs,  California,  1993,  OS  A  1993  Technical  Digest  Series  Volume  6,  p.81 

4.  B.  Sfez,  J.  L.  Oudar,  R.  Kuszelewicz,  J.GMichel,  R.  Azoulay  ;  Appl.  Phys.  Lett.  52.  p.1849 

(1990) 


WB2/197 


Racoaflgarablc  AirMteftaitJmd  on  Selective  Enabling  of  Mkrobsers 
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ABSTRACT 

We  report  on  a  reconfigunble  architecture  that  uses  a  static  free- space  optical  interconnect.  A  two-dimen¬ 
sional  array  of  microlasers  controls  a  two-dimensional  array  of  S-SEED  optical  logic  modulators  in  a  cas¬ 
caded  system. 

1.  INTRODUCTION 

In  previous  experiments  at  Rutgers  University,  a  property  known  as  functional  locality  [1]  was  observed  for 
a  mix  of  programs  executing  on  a  SPARC  processor.  The  concept  behind  functional  locality  is  that  only  a 
fraction  of  the  hardware  in  a  processor  is  used  for  a  given  interval  of  time,  and  that  there  is  repetition  within 
that  interval.  If  we  consider  implementing  just  1 0%  of  die  SPARC  instruction  set,  then  we  observe  a  hit  ratio 
on  the  order  of  80%.  That  is,  80%  of  the  instructions  are  executed  from  only  1 0%  of  the  code  over  a  lengthy 
period  of  time. 

As  a  result  of  this  finding,  the  development  of  a  function  cache  [1]  is  proposed  in  which  the  most  recently 
used  hardware  is  kept  in  the  form  of  interconnection  patterns  in  a  high  speed  section  of  a  processor.  A 
processor  that  supports  functional  locality  must  provide  for  gate-level  reconfiguration,  which  motivates  our 
development  of  an  optically  reconfigunble  architecture. 

A  model  for  a  reconfigunble  optical  architecture  that  was  demonstrated  in  die  Photonics  Crater  at  Rome 
Laboratory  in  September  1993  is  illustrated  in  Figure  1.  The  architecture  consists  of  an  army  of  active 
optical  logic  devices  (S-SEEDs)  that  are  interconnected  in  free  space.  The  architecture  is  customized  by 
selectively  enabling  channels  with  electrically  controlled  vertical  cavity  surface  emitting  lasers  (VCSELs) 
that  provide  optical  setup  power  for  the  S-SEEDs,  and  can  ultimately  provide  readout  power  for  the  S- 
SEEDs  as  well.  The  firce-space  interconnect  has  a  regular  pattern  implemented  as  a  simple  split-and-shift. 


2.  DISCUSSION 

A  smart  pixel  is  an  electronic  component  with  optical  input  and  output  ports.  The  complexity  of  a  smart 
pixel  is  typically  on  the  order  found  in  a  small  programmable  logic  array  (PLA),  which  may  be  a  few  logic 
gates  or  a  few  dozen  logic  gates. 


Figure  1:  Architectural  model  for  the 
demonstration  processor. 
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We  consider  the  use  of  smart  pixels  as  the  basis  for  a  reconfigurable  processor,  in  which  a  static  free-space 
interconnect  (a  perfect  shuffle  for  this  example)  is  customized  to  perform  specific  functions.  Figure  2  illus¬ 
trates  a  simple  version  of  this  scenario,  in  which  the  interconnects  among  full  adders  in  a  conventional 
ripple-carry  adder  are  forced  into  a  single  stage  of  a  perfect  shuffle. 

We  know  from  permutation  theory  that  a  set  of  N  inputs  can  be  arbitrarily  permuted  at  the  outputs  of  a 
shuffle-exchange  network  of  depth  SlogjAM.  In  Figure  2b,  N=16  and  so  the  depth  of  such  a  permutation 
network  for  the  general  case  is  11.  The  example  shown  in  Figure  2c  is  significant  because  it  shows  that  the 
interconnects  for  at  least  one  circuit  can  be  forced  into  a  single  stage  of  a  perfect  shuffle  structure,  rather 
than  the  theoretical  upper  bound  of  SlogjA-l,  and  so  no  depth  penalty  is  incurred  with  this  approach.  We 
were  concerned  that  this  mapping  might  only  have  been  possible  because  the  example  is  so  small,  and  so  we 
doubled  the  width  of  the  adder,  and  found  a  mapping  that  still  required  only  a  single  perfect  shuffle  stage. 
Two  irregularly  structured  mappings  w ere  also  investigated,  involving  the  interconnection  of  PL  As  for  a  12- 
bit  section  cany  lookahead  (SCLA)  adder,  and  a  16-bit  SCLA.  The  interconnects  are  irregular,  die  PLAs 
have  different  sizes  as  measured  by  the  varying  numbers  of  inputs  and  outputs,  and  there  is  fan-out  within 
the  interconnect  These  mappings  were  also  successfully  made  using  only  a  single  stage  of  a  perfect  shuffle. 

In  summary,  we  have  an  opportunity  for  reconfiguring  a  static  free-space  interconnect  to  implement  a  vari¬ 
ety  of  circuits  simply  by  selectively  enabling  connections  and  appropriately  labeling  the  PLAs.  Little  or  no 
depth  penalty  is  incurred  despite  the  forced  regularity. 
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Figure  2:  Interconnects  of 
a  4-bit  ripple-carry  adder 
(a)  are  mapped  onto  a 
single  perfect  shuffle.  Ge¬ 
neric  interconnection  to¬ 
pology  (b);  and  customized 
interconnect  for  the  adder 
(c). 
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A  new  concept  called  optical  array  logic  network  architecture  (OAL-NA)  is  proposed 
for  effective  construction  of  optoelectronic  hybrid  computing  system.  With  the  help 
of  optical  array  logic  (OAL),  not  only  data  communication  but  also  global  data 
processing  are  implemented  in  the  interconnection  network. 

I.  Introduction 

Digital  optical  computing  is  one  of  promising  schemes  for  massively  parallel 
computation  in  near  future  because  of  the  common  foundation  with  the  current 
computer  science.  According  to  the  concept,  a  lot  of  techniques  including  optical 
array  logic  (OAL)1)  have  been  proposed.  However,  quantitative  estimation  on 
processing  capability  suggests  that  a  processing  style  in  which  all  operations  are 
executed  by  optical  methods  is  inefficient  and  difficult  to  achieve  processing 
capability  comparable  to  electronic  computing  systems.  Current  electronic 
computers  offer  tremendous  computational  power,  so  that  massive  parallelism  and 
ultra  fast  operation  must  be  exploited  in  optical  computing  systems  to  overcome  the 
electronic  ones.  Considering  current  status  of  researches  on  optical  computing,  we 
conclude  that  at  least  in  near  future  optoelectronic  hybrid  optical  computing  systems 
are  promising  and  practical.  However,  even  for  hybrid  optical  computing  system, 
sophisticated  architecture  is  required  to  fully  utilize  capabilities  of  optical  computing 
techniques.  In  this  paper,  we  propose  a  concept  of  optoelectronic  hybrid  computing 
system,  in  which  OAL  is  used  for  high  functional  interconnection  network. 

II.  Concept  of  OAL-NA 

Optical  array  logic  network  architecture  (OAL-NA)  is  a  conceptual 
architecture  of  optoelectronic  hybrid  computing  system.  Essentially,  multiple 
electronic  processors  execute  arithmetic  and  logical  operations  in  parallel  and  they 
communicates  each  other  through  optical  interconnection  network.  The  most 
important  feature  of  the  OAL-NA  is  that  the  optical  interconnection  network  is 
implemented  by  OAL  processor  and  provides  high  functionality:  data 
communication,  data  search,  data  test,  command  delivery,  and  so  on. 

Figure  1  shows  a  schematic  diagram  of  an  example  system  of  OAL-NA.  In  the 
system,  multiple  processing  elements  (PEs)  with  local  memories  are  located  on  one 
plane  (PE  array)  and  a  set  of  a  global  sharing  memory  and  input/output  processors 
are  arranged  onto  another  plane  (MIO  array).  The  two  arrays  are  connected  by  an 
OAL  processor  with  symmetric  data  flow.  Export  signals  from  one  array  are  used  as 
inputs  of  the  following  OAL  processor  and  the  output  of  the  OAL  processor  is 
transferred  to  the  other  array.  Electronic  modules  on  the  array  complete  encoding 
required  in  OAL  and  generate  a  coded  image  by  composition  of  individual  elements 
of  the  modules.  Once  the  coded  image  is  generated,  the  export  signals  from  the 
electronic  modules  are  processed  by  OAL.  For  the  reversal  data  transfer,  the  same 
procedure  is  adopted.  For  inter-PE  communication,  round-trip  routing  is 
established.  Specifying  the  OAL  processor,  we  can  achieve  versatile  functions. 
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Fig.  1  Schematic  diagram  of  OAL-NA 


Fig.  2  Example  operations  in  OAL-NA 


ill.  Advantages  of  OAL-NA 

OAL-NA  has  various  advantages  with  respect  to  system  construction  and 
system  capability.  Use  of  highly  developed  microprocessor  and  smart  pixel 
technologies  for  PEs  is  practical  and  matches  with  current  trend  of  massively  parallel 
architecture  in  computer  science.  It  has  been  pointed  out  that  optical  interconnection 
in  interchip  level  is  advantageous  over  electronic  one.  In  addition,  merging 
interconnection  with  processing,  the  key  idea  of  OAL-NA,  is  expected  to  provide 
more  effective  communication  and  system  operation  than  conventional  simple 
optical  interconnection.  OAL  implements  single  instruction  stream  multiple  data 
stream  (SIMD)  type  of  parallel  processing,  which  is  suitable  for  global  search  and  test 
on  the  export  data  from  PEs,  memory,  and  input/output  processors.  In  terms  of 
processing  complexity,  global  search  and  test  are  appropriate  for  OAL  scheme  and 
the  advantages  of  OAL,  i.e.,  large  processing  capability  and  flexibility,  can  be 
effectively  utilized.  Although  an  encoding  device  must  be  prepared  even  for  a 
simple  OAL  processor,  the  encoding  process  can  be  combined  into  each  electronic 
modules  and  no  additional  device  is  required  in  OAL-NA. 


IV.  Operations  on  OAL-NA 

OAL-NA  provides  various  operations  in  the  interconnection  level.  Typical 
operations  are  1)  data  transfer  (parallel  data  shift,  broadcasting,  exchange,  token 
propagation2) ),  2)  data  test  (conditional  search,  validity  check),  3)  data  processing 
(differentiation,  logical  AND/OR  for  same/neighbor  PE’s),  and  so  on.  These 
operations  are  achieved  by  programming  of  OAL.  Figure  2  shows  the  examples  and 
corresponding  OAL  programs.  Accumulated  resource  in  OAL  research  can  be  fully 
utilized  for  OAL-NA. 


V.  Summary 

In  this  paper,  we  have  proposed  a  new  concept  called  optical  array  logic 
network  architecture  for  effective  construction  of  optoelectronic  hybrid  computing 
system.  With  the  help  of  optical  array  logic,  not  only  data  communication  but  also 
global  data  processing  are  implemented  in  the  interconnection  network.  As  a  result, 
effective  data  communication  and  system  operation  can  be  expected. 
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We  describe  the  design,  fabrication  and  testing  of  an  optical  interconnect  system  constructed 
from  a  two  dimensional  array  of  symmetric  self-electro-optic  effect  devices  and  a  free  space 
perfect  shuffle  module. 


The  perfect  shuffle  has  been  proposed  as  an  efficient  interconnect  that  optics  can  do  well[l]. 
However,  to  our  knowledge,  no  free  space  optical  system  has  been  constructed  that  tests  the 
operation  of  a  shuffle  between  optical  logic  elements.  Most  previous  work  has  focused  on  the 
development  of  the  perfect  shuffle  optics  without  regard  to  interfacing  to  the  optical  logic 
elements. 

An  optical  system  designed  to  be  used  to  implement  sequential  logical  processing  has  been 
fabricated  and  used  to  demonstrate  transfer  of  optically  encoded  d»t«  from  an  optical  logic  array 
though  a  perfect  shuffle  interconnect  and  bade  onto  the  logic  array.  An  array  of  self-electrooptic 
effect  devices  (S-SEEDs)  [2]  serve  as  the  logic  array  and  the  perfect  shuffle  is  similar  to  that 
developed  at  Heriot-Watt  University  [3]. 

The  basic  architecture  is  shown  in  Figure  1.  The  input,  output  and  power  supply  optics  are  not 
shown.  The  primary  optical  circuit  is  implemented  in  a  loop.  The  S-SEED  array  is  divided  up 
so  that  each  column  corresponds  to  one  stage  of  the  interconnect.  The  output  of  the  S-SEED 
array  is  routed  through  a  one-dimensional  perfect  shuffle  module  and  then  imaged  back  onto 
the  S-SEED  array  displaced  by  one  column.  This  produces  a  multistage  interconnect  with 
perfect  shuffles  between  each  stage. 

SEED  array 


The  S-SEED  array  is  arranged  to  so  that  it  is  used  as  8  columns  of  16  devices  each.  This  size 
was  selected  to  match  the  mid  of  a  commercial  7.78  mm  laser  objective  used  to  image  the  S- 
SEED.  Spots  to  read  out  the  array  are  generated  by  two  lasers,  one  for  the  even  columns  and 
one  for  the  odd  columns.  The  necessary  spot  pattern  is  generated  by  a  binary  phase  diffractive 
element 

The  perfect  shuffle  employs  the  basic  split-and-shifr  approach  to  implement  a  theoretically 
lossless  interconnect  [3].  Its  other  major  advantages  are  that  it  can  be  constructed  from  readily 
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available  components,  it  could  be  aligned  in  stages,  it  has  identical  loss  (theoretically  zero)  for 
all  the  beams  through  the  system,  it  could  be  enhanced  to  provide  internal  fanout,  and  would 
work  with  data  in  differential  representation.  Another  important  advantage  is  that  it  could  be 
to  maintain  telecentric  imaging  through  the  shuffle  and  anamorphic  optics. 

The  cascaded  input  of  data  is  provided  by  an  electrically  addressed  linear  array  of  vertical  cavity 
surface  emitting  lasers  (VCSEL).  It  is  used  to  set  the  first  column  of  S- SEEDs.  The  VCSEL 
array  is  directly  driven  with  an  open-collector  TTL  circuit  that  is  also  used  to  sequence  the 
column  and  programming  lasers. 

Figure  2  shows  a  schematic  of  the  basic  system.  Light  from  the  even  and  old  column  laser  is 
combined  and  then  the  array  of  spots  is  generated  These  spots  are  then  directed  through  a 
beam  combining  system  and  focused  onto  the  S-SEED  array.  The  reflected  light  from  the  S- 
SEED  is  shuffled  and  then  has  the  spot  pitch  corrected  by  an  anamorphic  telescope  before 
being  combined  and  imaged  onto  the  S-SEED  array.  The  entire  system  is  doubly  telecentric  for 
improved  alignment  tolerance,  except  at  the  intermediate  focus  of  the  anamorphic  telescope. 
The  system  was  constructed  using  the  milled  baseplate  approach  [4]. 
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Figure  2  Block  diagram  of  the  cascaded  system. 


The  system  has  been  assembled  ami  aligned  and  demonstrated  cascaded  data  transfer  at 
10  kHz.  The  design  and  alignment  process  have  uncovered  issues  relating  to  cascading  of 
devices,  operation  of  die  perfect  shuffle  and  free  space  optical  system  design. 

In  conclusion,  we  have  demonstrated  the  first,  to  our  knowledge,  free  space  optical  system  that 
cascades  light  through  a  perfect  shuffle.  This  system  was  constructed  using  bulk  optics  and 
employed  a  16x8  array  of  S-SEEDs. 
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Abstract 

We  present  a  novel  technique  for  parallel  optical  data  transcription  and  digital  logic  with  high 
speed  differential  pairs  of  optical  thyristors  and  demonstrate  these  vital  operations  with 
compact  optical  hardware  circuitry. 

Summary 


Among  the  very  promising  elements  for  digital  parallel  optical  information 
processing  are  differential  pairs  of  PnpN  optical  thyristors.  It  has  been  shown  very  recently 
that  these  optical  detector-regenerator  type  of  devices  can  be  switched-on  in  very  short  times 
(typical  10  ns)  with  very  low  switching  energies  (typical  40fJ  of  absorbed  optical  energy), 
while  the  problem  of  slow  switch-off  times,  causing  unpractical  cycle  times,  was  tackled 
using  a  double-heterojunction  structure  that  can  be  completely  depleted  by  means  of  a  simple 
negative  anodc-to-cathode  low  voltage  pulse.  In  this  way  PnpN  thyristors  with  switch-off 
times  of  10  ns  became  available  [1]. 

First ,  we  will  propose  in  this  paper  a  novel  approach  to  perform  two  vital  functions  in 
digital  optical  computing  using  these  high  speed  differential  pairs  of  optical  thyristors, 
namely  optical  data  transcription  and  digital  optical  logic.  We  therefore  use  a  differential 
pair  with  a  particular  electrode  configuration  such  that  each  thyristor  of  the  differential  pair 
(i.e.  Ta  and  Tb)  has  two  topelectrodes  (see  Figl).  This  makes  it  possible  to  control 
electronically  the  position  of  the  optical  output  of  the  activated  thyristor.  For  example,  if 
thyristor  Ta  wins  tire  competition  with  thyristor  Tb.  the  two  topelectrodes  of  Ta  allow  for 
three  different  light  output  combinations,  depending  on  the  voltages  applied  (see  Fig2).  Our 
conventions  to  represent  the  logic  1  and  a  logic  0  states  are  depicted  in  Fig3.  Moreover,  these 
topelectrodes  can  be  normally-connected  or  cross-connected.  We  show  that  cascading  one 
cross-connected  and  two  normally-connected  differential  pairs  of  optical  thyristors  in  the 
proper  sequence,  and  providing  them  with  the  correct  and  synchronized  voltage  sequences 
allows  the  performance  of  AND,  OR,  NAND  and  NOR  logic  functions. 

Next,  we  will  present  an  optical  hardware  circuit,  for  cascading  and  interconnecting 
these  differential  pairs,  such  that  both  digital  optical  logic  and  optical  data  transcription 
become  possible.  The  optical  hardware  consists  of  quarter  pitch  GRIN  lenses  and  cube 
splitters;  components  that  can  be  easily  packaged  with  the  thyristor  chips  into  very  compact 
and  stackable  optical  processing  modules. 

Finally,  we  show  videotaped  experimental  results  of  optical  data  transcription  and 
digital  optical  logic,  and  we  discuss  operation  speed  as  a  function  of  the  optical  energy 
transfer  efficiency  of  the  optical  hardware  circuitry. 
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Fig.  2  :  Joint  and  separate  turn-on  of  thyristor  Ta 


Reference:  [1]  P.  Heremans,  M.  Kuijk,  D.A.  Suda,  R.  Vounckx,  R.E.  Hayes  and  G. 
Borghs,  "Fast  turn-off  of  two-terminal  double  heterojunction  optical  thyristors",  Appl. 
Phys.  Lett.,  61(11),  pp.  1326-1328, 1992. 
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An  associative  memory  is  of  primary  importance  in  data  flow  digital  computers  [1],  Such  a 
memory  searches  by  special  code  (key)  coming  out  of  processing  unit  (PU)  for  a  data  set  to  be 
transferred  to  the  next  operation  for  an  execution.  A  high-speed,  high-capacity  optoelectronic 
associative  memory  (OAM)  implemented  of  K  one-port  OAM  modules,  where  K  is  the  number 
of  PUs  [2,3]  requires  using  in  data  flow  computing  architecture  the  high-performance 
reconfigurable  rearrangeable  nonblocking  interconnection  networks  of  size  KxK.  Its  realization 
is  highly  conjectural  nowadays. 

In  this  report,  the  new  optical  setups  of  multiport  OAM  are  suggested.  The  OAM  enables 
M  users  to  execute  a  simultaneous  and  independent  parallel  associative  data  search  and 
retrieval  into  memory  N  stored  words  by  M  search  arguments,  as  well  as  a  random-access 
writing  of  keys  and  data.  The  exploitation  of  such  a  multiport  associative  memory  in  data  flow 
digital  computers  is  illustrated  by  the  block  diagram  in  Fig.  1 .  The  circuit  diagram  that  explains 
the  principle  of  the  optoelectronic  multiport  memory  with  the  parallel  optical  associative  M  key 
search  for  M=4,  L=4,  and  N=16,  as  examples,  is  shown  in  Fig.  2.  Spherical  lenses  are 
employed  in  the  setup  only.  The  hybrid  memory  board  (HMB)  is  a  2-D  memory  cell  array, 
where  each  memory  cell  consists  of  the  light  modulator  and  MOS-transistor-based  trigger 
placed  near  it  and  electrically  connected  to  the  light  modulator  and  with  key  information  stored 
into  electronic  semiconductor  memory.  The  address  writing  of  keywords  in  the  HMB, 
encoding  and  decoding  of  memory  cell  physical  addresses  and  data  associated  keywords  is 
accomplished  by  the  electronic  devices  that  include  in  the  semiconductor  memory.  The  stored 
keywords  are  registered  as  transparent  and  nontransparent  areas  corresponding  to  1  or  0  of  the 
binary  bit  b^  The  functions  of  masked  search  argument  registers  are  accomplished  by  the  light 
source  arrays.  The  availability  or  absence  of  light  beam  corresponds  to  1  or  0  in  the  binary  bit 
alm  of  search  argument.  The  physical  coincidence  addresses  are  fixed  by  photodetector 
arrays  which  serve  as  threshold  optical  invertor  arrays.  The  OAM  scheme  represented  in  Fig.  2 
ensure  an  associative  search  in  the  HMB  in  light  transmission  mode.  However,  if  light 
polarization  plane  modulators  are  employed  for  the  HMB  it  can  be  modified  to  operate 
reflected  light  by  using  polarizing  beam  splitter  cube  placed  before  the  lens  of  the  hybrid 
memory  board. 

The  data  search  process  for  such  an  OAM  consists  of  the  following.  The  binary  bits  alm, 
ajm  aun  of  all  (m  ~  1,2.  ...  M)  search  arguments  produced  are  encoded  by  a  dual-rail  code 
and  binary  bits  blw  b„  ...  b^  of  all  ( n  =  1,2,  ...  N)  keywords  are  stored  in  an  inverse  dual-rail 
code.  The  optical  system  projects  each  search  argument  image  onto  the  keys  stored  on  the 
HMB  and  the  result  of  coincidences  (i.e.  ight  absences)  are  fixed  by  threshold  optical  invertor 
arrays.  As  seen  in  Fig.  2  the  optical  invertor  with  the  numbers  m  and  n  fixes  the  fact  of 
coincidence  of  the  mth  search  argument  with  the  nth  key. 

In  the  report,  a  same  variants  of  the  optical  scheme  for  the  multiport  optoelectronic 
memory  with  associative  data  search  are  suggested.  The  method  for  enhancing  the  memory 
size  and  write  rate  based  on  combining  the  K  multiport  OAM  modules  into  the  whole  system  is 
considered.  It  is  shown  that  such  a  mulimoduie  OAM  can  provide  a  simultaneous  and 
independent  M  key  search  for  the  number  of  ports  M=100  with  an  associative  data  access  time 
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of  ~10  ns  and  overall  peak  write  and  readout  rate  of  10'°  words  into  memory  of  overall 
capacity  of  up  to  KN  =  10M0*  words  with  50-bit  dual-rail  coded  keys  The  possibility  of 
creating  the  multiport  OAM  is  discussed. 

1.  Burtsev  V.S.,  Optical  Computing  &  Processing,  1, 1991,  no.  4,  p.  269. 

2.  Burtsev  V.S.  and  Fyodorov  V.B.,  Optical  Computing  &  Processing,  1,  1991,  no.  4,  p.  275. 

3.  Burtsev  V.S.  and  Fyodorov  V  B.,  Optical  Computing  &  Processing,  2,  1992,  no.  3,  p.  161. 
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Fig.  1 .  A  simplified  block  diagram  of  the  data  flow  computer  based 
on  the  multiport  associative  memory  :  PU-processing  unit,  I/O-input 
and  output,  MMR-multiple-march  resolver  and  encoder. 
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Fig.  2.  The  optical  scheme  of  the  optoelectronic  multiport  memory  with  parallel  associative  M 
key  search:  is  the  /th  binary  bit  (/=  1,2, ...  L)  of  the  mth  search  argument  word  (m=l,2,  .  .. 
M),  m  is  the  Ah  binary  bit  of  the  nth  stored  keyword  (n=l,2, ...  N),  is  the  physical  address 
of  the  coincidence  of  the  mth  search  argument  with  the  nth  keyword. 
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Abstract 

We  present  a  very  simple  optoelectronic  analog  retina  which  we  constructed,  using  t  CMOS 
technology.  The  experimental  results  indicate  that  the  system  is  capable  of  acquiring  optical  data 
of  in  image  and  detecting  the  edges. 

Summary 

Farh  basic  ceO  in  this  retina  contains  only  21  transistor*  (surface :  4.9  mm3).  When  connected 
to  the  photodiodes  It  Is  able  to  do  the  data  acquisition,  optical-electric  signal  transformation. 
calculation  for  edge  detection  and  (fata  drifting.  The  retina  it  expected  to  be  fanned  by  128*64  such 
cells. 

The  algorithm  lmpiarn^  in  fee  circuit  is  fused  on  the  "Roberts*  differential  detecdon(Flg.l-a 
tod  Hg.1-6).  Compared  to  the  ’classic  Robots”  detection  four  supplemental  operators  are 
introduced  into  the  template  inpulse  response  arrays  in  order  to  improve  the  sensitivity  of  edge 
orientation  (  S3.  S4,  SS,  S6  on  Hg.l-a).  The  compUaity  of  the  dreoit  for  such  in  improvement 
remains  the  same. 

013  EH  BE  EH  IEB  EH  EES 

Fig.  1  -a  Hm  H$i  Has  Hh 

Hgi'Hg]  :  classic  RobeTO  differentiators  Hj!  HSJ-  :  supplemental  differenditfon 


F(aui)  :  original  image  Oi(mji) :  graduot  in  the  /  tfa  direction 

Hilnui) :  spatial  diflreotutar  G  (mji) :  gradient  after  the  campus 

EM  :  edge  map 

Fig.l-b  General  procedure  of  the  "Roberts"  detection 

Fig  2  shows  the  results  of  the  "KHOROS"  simulation  for  die  detection  algorithm. 


(a)  Original  image 

(b)  Edge  map  of  the  “classic  Roberts" 

(c)  Edge  map  attained  by  using  the  first  four ", Roberts  “  operators.  Some  diagonal  lines  missing  on 
Ftg2-b  show  up  here. 

(d) Edge  map  attained  by  using  all  the  six  "Roberts"  operators.  Then  aren't  any  significant  differences 
between  Fig  3-c  and  Fig  3-d 

In  fact  the  operation  of  the  "Roberts"  spatial  convolution  is  the  combination  of  subtractions  of 
the  four  neighboring  pixels.  If  the  hminoos  signal  varies  slowly  compared  to  the  speed  of  die 
treatment,  we  can  decompose  the  calculation  in  terms  of  time  to  eliminate,  two  operations : 

1/  absolute  value,  since  the  operaion  of  I  IvFj  i  can  be  replaced  by"  max  {(Fi-FjXOfyFi))"* 
2/  max  value,  this  operation  is  followed  by  a  comparison  with  a  fixed  threshold,  we  can 
replace  the  two  operations  by  comparing  all  the  possible  input  data  to  the  threshold 
successively.  As  long  as  die  result  of  the  comparison  is  true,  T  is  writen  to  a  memory. 
So,  the  general  procedure  (Fig.  l*b)  can  be  simplified  as  the  iterated  sub-procedures  with  only 
subtraction,  comparison  and  writing  ”1"  in  die  memory  (Flg.3).  In  this  case  the  acquisition  should  be 
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sit?  Iterated  llff  ffr**Mfy  ™«nh»narirtf»«  nf  the  tlgmlt  Radi  actpilsitkm  it  immflfH  «triv 

followed  by  foe  sub-procedure.  The  procedure  of  the  detection  for  each  cdl  U  shown  io  Flg.3. 

The  calculattnn  becomes  "parallel  and  sequential".  The  processing  time  it  a  bit  longer  than  the 
"tool  parallel",  but  the  lmplantaloo  Is  much  simpler.  A  compromise  between  the  speed  tod  the 
simplicity  Is  mched. 

y :  aBtarafkapiullate 


FigJ  Simplified  procedure  of  the  "Roberts  ’detection  for  the  "parallel  aod  squendal’  implantation. 
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Fig.4  Pixel  array  FigJ  Scheme  of  a  basic  cell 

Flg.4  shows  the  spatial  arrangement  of  the  pixels  and  Fig.5  is  the  scheme  of  a  basic  cell  with 
the  four  neighboring  photodiodes. 

The  left  part  of  foe  cell  converts  the  difference  between  incident  intensities  (L1-L2,  for 
instance)  into  the  voltage  on  the  capacity  C.  By  turning  the  four  switches  (Ki-  K4)  on  or  off.  we  can 
choose  the  data  to  be  calculated.  The  control  signal  "elk"  drives  the  transistors  TA  and  T***  in  order 
for  the  capacity  C  to  be  charged  (by  means  of  the  current  mirror  to  change  the  sign  of  the 
photocurrent)  then  tfischarged  by  the  photocunent  Thus  the  voltage  Ve  is  foe  function  of  the 
difference  of  foe  tncMe«*  light  By  choosing  foe  value  of  Vg,  foe  grid  voltage  of  we  can  stabilize 

the  bias  of  the  photodiodes,  therefore  improving  foe  linearity  of  the  transformation. 

The  right  part  of  Fig. 5  performs  the  comparison,  memorisation  of  "1"  (ignore  "0"),  and  data 
shifting  to  the  output  The  value  of  the  threshold  is  related  to  foe  sizes  of  the  transistor*  and  the 

inverter**.  _ _ 

M( - "err.— -j - 

„  A  prototype  of  foe  cell  with  four 

l  photodiodes  is  produced  in  ES2 1,5  urn 

|  process.  The  expected  test  results :  the 

characteristics  of  the  left  part  of  Flg.5  is 
I  §3ti::§ijiSE  shown  on  Flg.6. 

^  p _ _ _ _ J.  Fig.  6  Test  results  of  the  linearity  of  the 

“n— ......  .. AL/Vc  transformation 

Because  of  foe  dynamic  configuration  of  foe  pixel  array,  for  each  cdl,  the  data  carried 
by  incident  light  on  foe  four  neighbouring  photodiodes  are  accessible  and  selectable.  All  the 
basic  calculations  for  the  detection  are  performed  by  the  cells,  yet  the  order  of  these 
calculations  can  be  simply  rearranged  by  modifying  the  sequence  of  the  control  signals.  So  both 
"classic  Roberts"  and  "modified  Roberts"  can  be  implemented  in  foe  same  circuit.  This  allows 
the  future  users  to  have  more  flexibility  to  meet  different  needs. 

We  are  still  trying  to  improve  foe  performance  of  the  circuit,  especially  the  linearity  of 
the  opdcal-dectric  transformation,  (tee  of  foe  improvements  in  foe  fixture  version  of  foe  circuit 
is  tte  compensation  for  foe  imperfection  of  the  current  mirror,  a  critical  part  of  the  circuit.  This 
will  be  dime  by  means  of  an  adaptive  control,  for  example,  a  charge  transfer  or/and  an  analog 
memory. 
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Abstract 

A  complete  statistical  analysis  and  modeling  of  a  generic  three-plane  optical  processor  is  preeented.  Output 
**g— 1  are  fnr  a  number  rf  in«#»Mtmg  yriri  cmm,  «mt  *»»  ftwAmwifl  theoretical  accuracy 

limitations  are  established. 
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Summary 

The  Boat  important  and  prominent  strengths  of  optical  processors  are  their  speed  and  parallelism.  However, 
they  also  suffer  from  some  fundamental  drawbacks,  the  most  critical  being  low  computational  accuracy,  that  limit 
their  application  to  real-world  problems. 

The  node  sources  that  determine,  and  bound,  the  accuracy  of  an  optical  processor  fall  into  two  groups.  On 
one  hand,  we  have  the  system  noiae  sources,  such  as  diffraction,  crosstalk,  and  background  radiation,  that  ariae  from 
the  processor  architecture.  On  the  other  hand,  we  have  the  device  noise  sources  that  introduce  inevitable  inaccuracies 
aaodaied  with  the  physical  representation  and  interpretation  of  die  information- bearing  signals  in  the  system.  Of 
parrirnlar  significance  are  the  unite  in  the  input  Hght  intensities  dan  to  phnmn,  earimtinn,  and  emwwni  fluctuations, 
the  randomness  in  the  SLM  intensity  transmittance*  due  to  transmission  aod  polarization  fluctuations,  and  the  noise 
in  photodetoction  due  >o  nonunity  quantum  efficiencies,  gam  fluctuations,  shot  noiie.  dark  current,  and  thermal  noise. 
Given  the  statistical  description  of  these  processes,  one  cn  then  utilize  tools  from  statistics!  optics  to  establish  the 
statistics  of  the  system  output  signaL1*  Baaed  on  these  results,  optimal  detection-  and  estimation-theoretic  techniques 
can  be  brought  to  bear  on  the  problem  in  an  effort  to  improve  the  accuracy  of  the  processor,  as  we  demonstrated 
earlier  in  proof-of-princqde  simulations. u 

Toward  this  end,  we  first  carried  out  a  general  statistical  analysis  of  a  Stanford-type  optical  matrix-vector 
multiplier  under  certain  simplifying  assumptions.1  establishing  the  probability  density  and  mutual  coherence  functions 
of  the  detected  field  intensity  and  the  photodetector  output  cunent  in  tenne  of  the  «»— «tir»i  characteristics  of  the 
syatem  components.  We  then  passed  to  a  mare  general  and  rigorous  study  of  a  generic  three-plane  processor,  dins 
expending  our  scope  to  a  wider  class  of  systems  including  optical  correlators,  optical  interconnects,  and  optical  linear 
algebra  processors.** 

We  next  turned  to  evaluating  these  expressions  Cor  particular  cases  of  interest,  obtaining  system  output 
statistics  for  various  combinstions  of  sources,  modulators,  and  detectors.*  Specifically,  we  considered  classical  sources 
such  as  laser  diodes  and  LEDs,  and  popular  photodetectori  such  as  p-i-n  and  avalanche  photodiodes.  Since  statistical 
models  for  moat  SLMs  are  currently  unavailable,  we  considered  hypothetical  models  such  as  a  neutral  device  with 
no  tranaatinance  fluctuations  and  a  Gnutaian  random  ampfitudafrhaae  screen.  Brao-apaoe  propagation  was  assumed, 
and  Presnel-regime  diffraction  at  well  as  die  geometrical -optica  limit  were  considered. 

In  this  paper,  after  briefly  reviewing  these  preliminaries,  we  proceed  lo  establish  the  overall  system  output 
statistics  for  a  number  of  interesting  device  combinations.  We  then  provide  a  quantitative  definition  of  accuracy  for 
analog  optical  processors,  first  from  a  detection-theory  perspective  and  then  from  an  estimation-theory  perspective. 
With  the  availability  of  the  output  signal  probability  density  function,  die  maximum  achievable  accuracy  Cor  each 
special  case  is  then  found  as  a  function  of  the  system  parameters  involved.  We  also  present  theoretical  results  an 
the  attributes  of  qpeed  and  parallelism,  which  are  intimately  related  to  accuracy.  Finally,  future  work  on  accuracy 
enhancement  is  suggested. 
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Abstract 

A  fiber  optic  tapped  delay  line  correlator  based  on  time-integration  is  proposed  and 
demonstrated.  Such  a  correlator  will  combine  high  bandwidth,  large  processing  gain  and 
ability  to  compute  real-time  autocorrelation  functions. 

Summary 

Fiber  optic  delay  lines  have  been  suggested  for  use  in  tapped  delay  line  architectures  for 
performing  cross-correlation  operations^  .3.  in  those  systems  fibers  of  different  length 
implement  the  delay  lines  and  a  spatialXlight  modulator  encoded  the  tap  weights.  Although 
attractive  due  to  the  potential  for  high  bandwidth  operation,  this  system  suffers  from  two  major 
limitations.  The  number  of  taps  limits  the  signal  length  and  therefore  the  correlation  gain. 
Since  each  tap  introduces  signal  loss,  the  total  number  of  taps  will  be  limited  to  a  few  hundred. 
Additionally,  the  system  shown  cannot  implement  real-time  cross-correlations  between  two 
arbitrary  signals.  Similarly,  calculation  of  the  autocorrelation  of  a  signal  requires  temporary 
storage  of  the  signal  and  high  bandwidth  updating  of  tap  weights. 

Both  of  these  limitations  can  be  overcome  by  using  an  alternative  design  based  on  time- 
integration.  Figure  1  shows  a  diagram  of  a  fiber  optic  tapped  delay  line  time-integrating 
correlator.  The  input  signal  s(t)  directly  modulates  a  laser  diode.  The  modulated  optical 
signal  is  split  by  a  1-to-N  fiber  coupler  and  a  relative  delay  is  introduced  into  each  of  the  N 
channels  by  adjusting  the  length  of  the  fibers.  The  second  input  signal  r(t)  is  applied  to  a  high 
bandwidth  optical  modulator.  The  time-delayed  representations  of  s(t)  are  modulated  by  r(t) 
and  imaged  onto  a  1-D  time-integrating  detector  array.  The  output  of  the  n*b  detector  at  the  end 
of  the  time-integration  will  correspond  to: 

c(n)  =  J  s(t-Tn)  r(t)  dt 

where  Tn  is  the  relative  time-delay  in  the  n®  fiber  tap  and  the  integration  is  performed  ovst 
time  t.  The  correlation  gain  is  determined  by  the  integration  time  of  the  detector  array  and  is 
independent  of  the  number  of  taps.  The  number  of  taps  determines  the  number  of  points 
computed  in  the  correlation  function.  Therefore,  the  time-integrating  architecture  can 
accommodate  a  longer  signal  length  while  limiting  the  number  of  points  in  the  cross¬ 
correlation  function  to  the  number  of  taps.  If  signals  s(t)  and  r(t)  are  identical,  autocorrelation 
results. 

This  architecture  was  experimentally  demonstrated  in  an  8-channel  correlator.  A  TeC>2 
acoustooptic  modulator  A  100  MHz  analog  LED  at  860  nm  provided  the  light  input  The  output 
was  detected  on  a  CCD  camera  and  digitized.  Although  the  AO  device  has  a  50  MHz  bandwidth, 
the  fiber  outputs  were  focused  to  approximately  100  micron  spot  in  the  device  leading  to  a  transit- 
tune  limited  bandwidth  of  only  5  MHz.  The  schematic  diagram  of  the  experimental  setup  is 
shown  in  Figure  2.  The  experimental  results  obtained  on  a  sampled  autocorrelation  of  a  5  MHz 
*Qoare  wave  are  shown  in  Figure  3.  The  points  indicate  theoretical  values  and  the  continous 
line  shows  experimental  results. 
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Figure  2:  Schematic  diagram  of  the  experimental  set  up 
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ABSTRACT: 

We  optically  experiment  a  joint  transform  correlator  (JTC)  where  the  input  image  is 
replaced  by  a  synthetic  discriminant  function  (SDF)  filter.  This  latter  allows  us  to  ensure 
a  practical  invariance  over  a  10°  range. 


INTRODUCTION: 

Optical  correlation,  and  especially  joint  transform  correlation,  has  been  known  as  an  efficient 
alternative  to  its  electronical  counterpart  since  the  1960's  and  compact  implementations  of  joint 
transform  correlators  (JTCs)  have  recently  been  presented.  But  correlation  is  inherently  rotation- 
and  scale-  variant  and  a  traditional  correlation  operation  cannot  perform  pattern  recognition  in 
most  practical  cases.  In  order  to  overcome  this  limitation,  Javidi1  has  proposed  and  simulated  a 
JTC  using  synthetic  discriminant  function  (SDF)  filters.  In  this  paper,  we  provide  optical 
experimental  results. 


BASIC  THEORY: 

The  basic  principle  of  joint  transform  correlation  is  now  widely  known,  and  we  will  not  present 
it.  The  need  of  ensuring,  for  instance,  a  rotation  invariance  appears  clearly:  an  in-plane  rotation 
of  one  degree  causes  a  dramatic  energy  loss  of  more  than  3  dB.  To  achieve  an  invariance  for  such 
a  phenomenon,  we  decided  to  use  the  synthetic  discriminant  function  technique3,  a  general 
method  to  achieve  any  invariance  which  in  our  case  consists  in  replacing  the  reference  image  with 
several  rotated  views.  The  correlation  peak  height  will  be  equal,  provided  the  input  view  belongs 
to  the  training  set  The  point  is  to  test  intermediate-rotated  input  views,  between  those  of  the 
training  set. 

The  simplest  SDF  filter  is  called  pseudo-inverse  SDF  (PI-SDF).  Its  computation  is 
straightforward  because  it  is  only  a  linear  combination  of  views  from  the  training  set.  Other 
types  of  filters  have  then  been  derived  from  it:  the  minimum  variance  SDF3  (MV-SDF)  which  is 
noise  resistant,  the  minimum  average  correlation  energy4  SDF  (MACE-SDF)  which  sharpens  the 
correlation  peak  and  improves  discrimination  and  the  optimal  trade-off  SDF6  (OT-SDF)  which  is 
a  linear  combination  of  the  previous  two  filters. 


SIMULATIONS: 

We  tested  the  four  kinds  of  SDFs  mentioned  above,  in  order  to  verify  their  training  ability.  For 
each  type  of  filter,  we  calculated  four  filters,  each  of  them  containing  6  views,  with  a  various 
angular  step  between  two  consecutive  views:  0.5°,  1°,  2°  and  4°. 

The  simulations  give  the  same  results  as  those  in  the  literature  for  the  4f  architecture:  the  PI- 
SDF  training  ability  is  average:  there  is  a  marked  difference  between  the  images  belonging  to 
the  training  set  and  images  of  the  same  airplane,  but  not  in  the  training  set  The  discrimination 
between  the  plane  of  the  training  set  and  another  lifelike  plane  is  unambiguous  (the  ratio  of  the 
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correlation  peak  heights  is  over  4).  The  MACE-SDF  only  recognizes  the  views  it  contains  and  so 
proves  useless,  and  the  MV-SDF  produces  a  very  poor  discrimination,  sometimes  dose  to  unity. 


EXPERIMENTS: 

We  performed  experiments  on  the  CERT  Optical  Science  Department  demonstrator,  in  which 
SLMs  are  amplitude-modulating. 

We  tried  to  test  the  four  kinds  of  filters  previously  simulated.  Unfortunately,  MACE-SDFs,  MV- 
SDFs  and  so  OT-SDFs  prove  unusable:  they  are  non-positive,  so  produce  a  non-zero  background 
on  our  amplitude  SLM  and  especially  their  dynamic  range  is  high.  The  input  setup  cannot 
display  both  a  large  dynamic  range  and  tiny  details. 

We  experimented  6-view  PI-SDFs,  with  respective  angular  steps  of  0.5°,  l.°,  2.°  and  4.° 
between  consecutive  planes.  For  a  0.5,  1  or  2°  difference,  the  output  response  is  quite  uniform 
over  the  angular  range:  +/-  7  %  (see  fig  1  for  the  2°  step).  The  training  ability  is  good:  we  cannot 
even  find  any  difference  between  die  images  belonging  to  the  training  set  and  images  of  the 
same  airplane  which  are  between  those  in  the  training  set  The  discrimination  is  also  good  (the 
ratio  of  the  peak  heights  is  at  least  1:5). 

For  a  4°  difference,  the  results  deteriorate  dramatically  (fig.  2)  and  are  not  in  good  agreement 
with  the  simulations. 
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Fig.  1:  simulated  and  experimental  correla¬ 
tion  peak  heights  for  a  SDF  whose  views  dif¬ 
fer  from  2°.  Crosses  denote  autocorrelation 
when  the  input  view  belongs  to  the  training 
set 


Fig.  2:  simulated  and  experimental 
correlation  peak  heights  for  a  SDF  whose 
views  differ  from  4°.  Crosses  denote  autocor¬ 
relation  when  the  input  view  belongs  to  the 
training  set 


CONCLUSION: 

We  have  implemented  a  joint  transform  correlator  using  synthetic  discriminant  function  filters. 
The  experiments  proved  that  the  MACE-SDF  and  the  MV-SDF  -and  so  the  OT-SDF-  cannot  be 
displayed  correctly  on  an  amplitude  SLM.  But  the  PI-SDF  which  is  correctly  displayed,  works. 
We  have  shown  that  for  an  in-plane  rotation  and  provided  that  the  angular  step  between  the 
views  it  contains  is  not  too  high,  this  filter  can  lead  to  an  unambiguous  discrimination,  for  any 
object  orientation  over  the  filter  angular  range:  it  has  shown  a  certain  training  ability  and 
especially,  that  it  could  be  optically  implemented,  without  too  much  damage,  into  JTC 
architectures  whose  compact  examples  have  become  popular. 
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Abstract 

An  optimal  processor  for  discrimination  and  noise  robustness  is  developed  for  pattern 
recognition.  It  leads  to  a  natural  and  attractive  high  speed  optical  implementation.  Fur¬ 
thermore,  it  provides  theoretical  insight  in  previous  heuristic  non  linear  filtering  techniques. 

Within  the  past  decade  different  architectures  for  optical  correlators  have  been  demonstrated 
[1-3].  Although  the  spatial  matched  filter  is  optimal  for  noise  robustness,  its  limitations  such 
as  broad  correlation  peaks,  sensitivity  to  distortion  and  low  discrimination  capabilities  are  well 
known  and  different  filters  for  optical  correlation  have  been  proposed  [4-7].  On  the  other  hand, 
nonlinear  joint  transform  correlators  (JTCs)  [3],  have  shown  to  be  very  discriminant  with  good 
correlation  performance.  For  optical  correlation  characterization,  different  criteria  have  been 
proposed  [8],  and  among  them,  noise  robustness  and  sharpness  of  the  correlation  function  have 
been  shown  to  be  of  the  first  interest.  Furthermore,  the  importance  of  finding  trade-offs  between 
different  criteria  is  now  well  established  [7,8].  However,  up  to  now,  the  discrimination  capabilities 
of  the  filter  are  optimized  indirectly  by  minimizing  either  the  sharpness  of  the  correlation  function 
[7]  with  false  objects  (that  is  objects  to  be  rejected)  or  background  models  to  be  discriminated 
against  [9].  Here,  an  optimal  method  for  discrimination  capabilities  which  doesn’t  need  a  priori 
knowledge  of  the  false  objects  or  of  the  background  is  introduced. 

ID  notations  are  used  for  simplicity  with  no  loss  of  generality.  Let  r(t)  and  s(t)  denote 
respectively  the  reference  and  input  image.  The  output  of  the  optimum  processor,  C(t)  = 
h(t  +  0*3(£),  *s  the  correlation  between  s(t)  and  a  filter  function  h(t),  where  *  denotes 
complex  conjugation,  and  N  is  the  total  number  of  pixels.  When  the  input  image  is  the  reference 
object  (i.e.,  s(t)  =  r(t)),  it  is  imposed  that  the  filter  produces  the  correlation  peak  C(0)  =  Co . 

If  the  input  image  is  a  modified  or  a  distorted  version  of  the  reference  image  r(t),  one 
generally  requires  a  small  variation  in  the  correlation  peak.  This  requirement  can  be  achieved  by 
minimizing  the  mean  square  error,  which  is  equivalent  to  the  optimization  of  the  noise  robustness 
to  input  noise  [10].  In  order  to  optimize  the  discrimination  capabilities,  one  may  minimize  the 
energy  of  the  correlation  function  due  to  any  input  image  s(t):  E,[h]  =  J2k  |h(k)|2  |s(fc)|2. 

It  can  be  shown  that  the  optimal  trade-off  between  input  noise  robustness,  discrimination 
£a[h],  and  correlation  peak  sharpness  (see  [8]  for  a  mathematical  definition  of  this  criterion) 
leads  to  the  optimum  processor  (in  the  Fourier  domain): 

C(k )  =  r-(k)i(k)/[S(k)  +  (1  -  p)  |f(*)|3  +  p\m\2)  (1) 

where  S(k)  is  the  spectral  density  of  the  noise  model.  The  performances  of  this  processor  will 
be  illustrated  with  numerical  experiments  in  comparison  to  other  well  known  filtering  methods. 
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Eq.  1  is  very  similar  to  the  Fourier  transform  of  a  nonlinear  JTC  output  [3]  which  can  be 
implemented  optically.  Indeed,  both  have  the  same  Fourier  phase  and  the  amplitude  modulation 
requires  nonlinear  transformation  of  both  Fourier  magnitudes  of  the  reference  function  and  the 
input  function.  Let  us  consider  the  case  of  a  white  noise  model  (S(fc)  =  /3  which  is  constant). 
Using  a  photorefractive  crystal  in  the  Fourier  domain,  it  is  possible  to  implement  optically  the 
optimal  method  of  Eq.  1.  Indeed,  if  one  considers  a  four  wave  mixing  correlator  [11]  [12],  the 
modulation  index  of  refraction  is  proportional  to  Eq.l.  This  architecture  is  sketched  in  fig.  1. 


Figure  1:  Schematic  representation  of  a  4 
waves  mixing  correlator.  Input  image  s(t) 
is  written  on  SLM1,  the  auto-correlation 
of  the  noise  model  on  SLM2  and  the  ref¬ 
erence  on  SLM3. 


If  one  considers  the  case  for  which  peak  sharpness  in  not  optimized,  the  optimal  processor  is 
now  given  by:  C(k)  =  fm(k)s(k)/[l2  +  |i(fc)|2].  This  can  be  obtained  if  the  intensity  of  beam 
3  is  low  compared  to  beams  1  and  2  in  fig.  1.  This  case  is  very  interesting  since  beam  3  is  just 
reading  the  grating  written  by  beams  1  and  2  (i.e.  s(k)/[l2  +  |s(fc)|2]).  This  process  doesn’t 
require  interaction  in  the  crystal  and  then  can  be  done  at  higher  rates  than  the  writing  process  of 
the  grating  (which  is  limited  by  the  response  time  of  the  crystal).  This  property  is  in  agreement 
with  pattern  recognition  applications  which,  in  general,  need  many  references  for  a  recognition 
in  the  input  image. 
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Pattern  recognition  using  a  coherent  optical  correlator  has  many  advantages  including 
high  speed  and  parallel  processing.  The  main  problem  of  those  systems  is  that  usually 
they  can  not  cope  with  the  various  changes  of  the  input  function  such  as  rotation  and/or 
scale. 

We  present  a  system  which  is  invariant  to  all  three  distortions  (rotation,  scale  and 
shift)  and,  at  the  same  time,  can  complete  the  recognition  process  practically  in  real  time. 
The  overall  system  is  composed  of  two  parallel  channels  that  are  connected  together.  A 
schematic  diagram  of  the  system  is  given  in  figure  1. 

Basically  the  system  operate  as  follow:  The  input  object  is  placed  on  the  first  SLM 
(spatial  light  modulator),  from  which  it  enters  a  correlation  channel  and  a  scale  measure¬ 
ment  channel.  After  the  scale  of  the  object  is  estimated,  the  correlator  is  adapted  by 
changing  the  scale  of  the  filter  or/and  the  input  pattern. 


Figure  1:  Schematic  representation  of  the  the  overall  recognition  system 


The  scale  measurement  is  performed  at  the  Fourier  plane  where  it  can  be  easily  im¬ 
plemented  optically.  Let  /(r,  0)  be  the  input  function  and  F(p,  <f>)  its  Fourier  transform 
(in  polar  coordinates).  When  the  scale  of  the  input  is  extended  by  factor  a  the  energy 
, using  Parseval’s  theorem,  becomes: 

Ea=Io  L  \a2F(aP^)f  =  (1) 

where  Eo  is  the  energy  of  the  original  function  f(r,  9).  We  define  the  ratio  between  Ea(p), 
the  energy  in  the  spatial  frequency  interval  0  <  p  <  p,  and  the  total  energy  Ea  by: 


T(a,  p)  =  ^  =  jf ’  J I*  |a2F(ap,  *)f  pdpd<f>  = 


Eo 


(2) 


M 


£ 
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Obviously  T(a,p)  depends  on  p  only  through  the  integration  limit,  hence  we  can  derive 
the  scale  factor  a  from  the  simple  relation: 


where  po  is  the  spatial  frequency  which  gives  the  predefined  To  for  a  known  reference 
object  of  scale  oq. 

The  correlation  channel  is  based  on  an  adaptive  4-f  correlator  in  which  both  filter  and 
object  can  be  modified,  in  real  time,  on  the  SLM’s.  With  this  approach  the  system  is 
very  flexible  and  can  be  easLy  adjusted.  The  filter  used  for  the  correlator  was  specially 
designed  according  to  the  system’s  limitation.  The  main  requirements  from  the  filter 
were:full  rotation  invariance,  sharp  and  high  correlation  peak,  good  discrimination  ability 
and,  finally,  a  limited  scale  invariance.  This  final  requirement  is  essential  due  to  the  errors 
during  the  scale  measurement  procedure.  Obviously,  the  scale  tolerance  range  has  to  be 
large  enough  in  order  to  fully  compensate  for  those  errors. 

The  filter  was  designed  using  the  method  of  projection  onto  convex  sets  (POCS).  The 
special  characteristics  of  the  filter  are  achieved  by  applying  appropriate  constraints  during 
the  POCS  algorithm. 

Both  simulation  and  laboratory  experiments  were  successfully  performed.  Excellent 
results  were  obtained  with  alphanumeric  characters  as  well  as  with  gray  scale  images. 

The  adaptive  4-f  real  time  correlator  enable  us  to  use  the  system  lor  many  related 
application  such  as  iterative  filter  design,  pattern  classification  etc.  The  double  channel 
method,  in  general,  can  be  used  for  a  different  set  of  distortions  in  a  similar  way. 


WD1/219 


Systems  Partitioning  and  Placement  in  Optoelectronic  MCM  Design 


J.  Fan,  S.  H.  Lee  and  C.  K.  Cheng 
Dept  of  ECE,  University  of  California,  San  Diego 
La  Jolla,  CA  92093-0407 
(619)  534-2413 


We  ducms  the  CAD  issues  for  partitioning  between  electrical  and  optical  interconnects  and  physical  layout 
in  free-spnoe  optoelectronic  MCM  design.  The  results  of  a  design  example  are  also  presented. 


Free-space  optical  interconnections  (FSOI)  have  been  shown  to  have  speed  or  power  advantage  over 
electronic  interconnections  far  long  distance  interconnections!  1  ] .  By  introducing  FSOI  into  new  MCM  designs, 
it  is  possible  to  create  an  OptoElectronic  MCM  (OE  MCM).  However,  incorporating  optics  into  electronics 
presents  new  challenges  in  computer-aided  design,  fabrication,  and  packaging.  To  integrate  optics  jwid 
electronics  at  the  system  level,  we  need  new  partitioning  and  placement  algorithms.  In  this  paper,  we  first 
introduce  the  models  for  partitioning  and  placement.  We  have  chosen  the  system  power  dissipation  as  the  cost 
function  to  minimize  in  partitioning.  In  placement,  we  minimize  the  interconnection  distance  based  on  the 
constraints  imposed  by  the  fabrication  limit  (or  coat)  of  the  system.  The  models  are  that  applied  to  a  design 
example  to  illustrate  the  importance  of  CAD  issues. 


Figure  1.  Schtmatir  drawing  of  •  physical  model  of  OE  MCM 
package  in  reflective  configuration.  Only  a  few  of  the  many 
actual  interconnects  are  shown. 


Minor 


Figure  2.  Cross-Section  of  m  Opto- Electronic  MCM  package. 


Figure  1  shows  a  physical  model  of  an  OE 
MCM.  There  are  three  layers  in  the  packaged 
system.  The  lower  layer  consists  of  multiple 
chips  of  electronics  and  optoelectronic  devices. 
Each  chip  contains  a  number  of  switching 
elements  (SEs)  or  processing  elements  (PEs). 
The  middle  layer  consists  of  diffractive  optics 
(DOEs)  or  computer  generated  holograms 
(CGHs).  The  upper  layer  is  a  minor.  The  cross 
section  of  this  system  is  shown  in  figure  2.  The 
optical  transmitters,  e.g.  surface  emitting  lasers, 
illuminate  an  off-axis  lens  where  the  beam  is 
collimated  and  then  deflected  into  the  desired 
direction.  The  beam  then  reflects  off  a  minor  and 
is  focused  by  another  off-axis  lens  onto  a  receiver 
(detector)  on  another  chip. 

There  are  four  aspects  of  optoelectronic  system 
design  that  are  key  to  the  system  performance: 
interconnection  speed,  heat  dissipation,  chip  size 
and  MCM  size,  and  maximum  optical 
interconnect  distance.  Currently  in  OE  MCM 
design,  optical  sources  (such  as  modulators  or 
laser  diodes  and  their  drivers)  generate  heat  in 
densities  mud)  higher  than  that  of  VLSI.  Based 
on  speed,  switching  energies,  and  power  budgets, 
electrical  and  optical  technologies  can  be 
compared  and  a  break-even  line  length  cm  be 


definedjl].  For  signals  that  must  propagate  distances  longer  than  this,  optical  interconnection  technology  is 
preferred,  far  signal  propagating  distances  leu  than  this,  electronic  interconnection  technology  is  preferred.  It 
can  be  shown  that  minimum  feature  size  of  the  fabrication  technology  limits  the  maximum  interconnect 
distance  [2], 

fat  the  optoelectronic  design  process,  partitioning  is  performed:  (a)  to  divide  the  total  number  of  PEs  in  the 
system  into  fewer  number  of  chips;  and  (b)  to  choose  the  best  technologies  for  each  interconnect  required  by  tbe 
netBst  of  a  certain  computing  architecture.  After  the  partitioning  process,  placement  algorithms  will  perform 
fee  physical  assipments  of  the  PEs  within  each  chips  to  reduce  the  fabrication  requirement  of  DOEs/CGHs. 
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To  the  partitioning  of  the  OE  MCM  design  using  CAD  technology,  a  formulation  most  be 

established  in  tenns  of  the  technology  constraints.  One  choice  is  to  let  three  of  the  technology  variables 
mentioned  above  be  constrained  and  attempt  to  minimize  the  fourth.  Since  power  dissipation  of  the  system 


Random 


Algorithm 


No.  of  Chip* 

Figure  3.  Comparison  of  the  power  dimpiOcio  of 
the  between  random  partitioning  and 

algorithm  partitioning.  Over  30%  improvement  can 
be  achieved  by  applying  the  algorithm. 


u  present  a  design  problem,  the  most  natural  objective  of 

j  Random  11)6  pvtitioaians  appears  to  be  to  minimize  the  heat 

_ _ _ — - *  •  dissipation  while  satisfying  the  limits  placed  on  the  speed 

*  ■  ■  of  the  interconnects,  the  size  of  the  chip  and  OE  MCM, 

/  and  the  optical  distance. 

'  y  To  solve  the  OE  MCM  partition  problem,  we  combine 

4 . .  Algorithm  -  an  algorithm  based  on  Burfcard  heuristic[3]  and  a 

- "  modified  matching  algorithm  based  on  labeling 

-2  -  -  tedmiques[4].  The  optimized  partitioning  is  achieved  by 

_ _ _ _ )  rearranging  the  PEs  among  different  chips  to  reduce  the 

number  of  interchip  interconnections,  and  therefore  to 
1  reduce  the  power  dissipation.  Figure  3  shows  a 

No.  of  Chips  comparison  of  the  results  of  random  partitioning  with  the 

Figure  3.  Comparison  of  the  power  dissipation  of  results  Of  the  CAD  algorithm  on  an  irregular 
the  interconnects  between  random  partitioning  and  interconnection  network  example[5],  for  various  numbers 
slgorithm  partitioning.  Over  50ft  improvement  can  of  chips.  An  improvement  of  over  50%  is  achieved  in 
be  achieved  by  applying  the  algorithm.  many  of  the  cases  studied. 

3.  PE  Pin  r— ret  hi  OE  MCM 

hi  the  design  of  optoelectronic  MCM  with  tree-space  optical  interconnectian,  it  is  the  maximum 
interconnection  distance  that  should  be  minimized  We  investigated  placement  algorithms  far  optoelectronic 
systems  bused  on  iterative  matching[5].  Tbe  algorithm  starts  with  a  random  placement  of  PEs  within  chips  and 
reduces  the  maximum  interconnection  distance  through  a  fixed  number  of  iterations  by  rearranging  PEs  within 
mrimumamriahmmm  tinatimlaramnud  chips.  Figure  4  shows  the 

ST*  _  histogram  results  of  a  straight 

Z  |  I  ~  |  ■  _  forward  placement  and  placement 

”11  will  by  a  iterative  matching  algorithm 

will  l|l.  far  comparison.  The  same 

~|||_|  llll  network  example  used  in  the  last 

“(Will _  "till!  I  section  is  used  here.  More  than  a 

i  i  t  t  «  i  «  7  i  t  •  i  f  i  i  i  «  7  50%  reduction  in  the  imxwnmp 

MNMMfinio  rnmtMMmrnm—m  interconnect  Hifncp.  (or  on  the 

Bgmc  4.  Compariaan  between  two  placement  metboda.  (»)  Result  of  die  P*1 2 3 4 5 6***™*  votmne  of  the  system) 
atraight  forward  method,  (b)  Rcauk  after  applying  die  placement  algorithm.  c*®  **  achieved  when  we  applied 

TV  misimam  intMrtwwf*  ifaiwy  j«  reduced  by  mi  the  algorithms  to  the  CH>MCM 

design. 

In  this  paper,  we  discussed  optoelectronic  MCM  models  and  algorithms  far  CAD  partitioning  and 
placement.  Results  show  that  more  than  50%  reduction  in  power  consumptioo  and  the  maximum  interconnect 
distance  can  be  achieved  by  applying  these  CAD  algorithms  to  s  OE  MCM  design. 
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Abstract:  The  architecture  of  a  "universal"  photonic  backplane  is  described  The  architecture 
contains  a  large  array  of  "programmable"  smart  pixels  which  can  be  configured  into  three  states. 
By  setting  pixel  states  appropriately,  any  network  can  be  embedded  into  the  backplane. 


PCS  or  MCM 


Optical  C 


Figure  1:  (a)  PCBs  with  Optical  Extender  Card, 
(b)  Photonic  Backplane. 
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Summary:  A  photonic  backplane  consists  of  a  large  number  of  parallel  optical  channels  (10,000  to 
100,000)  spaced  a  few  hundred  microns  apart  [1][2],  Each  printed  circuit  board  contains  one  or 
more  smart  pixel  arrays,  which  access  the  photonic  backplane  through  an  "optical  extender  card", 
as  shown  in  fig.  1  [1].  This  paper  proposes 
an  attractive  photonic  backplane 
architecture  which  we  call  the 
"HyperPlane". 

The  smart  pixel  arrays  manage  access  to  the 
backplane.  A  smart  pixel  consists  of  an 
incoming  window,  and  out-going  window,  a 
latch,  two  multiplexers  and  an  address  bit 
comparator,  as  shown  in  fig.  2.  Pixels  can 
be  programmed  to  be  in  one  of  three  states, 
the  "idle",  "transmitting"  and  "receiving" 
states,  as  shown  in  fig.  3.  The  state  of  a 
pixel  can  be  changed  by  down-loading  a  bit- 
stream  from  an  associated  message- 
processor.  The  pixels  can  also  be 
programmed  to  receive  messages  for  any  destination  by 
down-loading  the  appropriate  address  bits.  The  pixels 
require  12  logic  gates  each.  The  pixels  are  arranged  into  an 
array  as  shown  in  fig.  4.  The  data  for  configuring  the  array 
is  loaded  in  bit-serially;  data  to  be  transmitted  enters  from 
the  top,  and  data  being  received  exits  from  the  bottom.  The 
smart  pixel  arrays  are  being  designed  using  AT&T's  FET- 
SEED  GaAs  technology  [3]  and  are  manufacturable  using 
existing  fabrication  technology. 

The  HyperPlane  can  embed  any  conventional 
interconnection  network  by  programming  the  pixels 
accordingly.  Optimal  embeddings  would  minimize  the 
length  of  the  longest  optical  channel  in  the  backplane. 

Optimal  embeddings  for  linear  arrays,  2D  and  3D  meshes, 
toroids,  hypercubes,  crossbars,  orthogonal  crossbars,  and 
shuffle-based  networks  have  been  identified.  Typical 
embeddings  are  shown  in  figures  5-6.  Each  box  represents 
a  node.  Each  vertical  line  represents  an  electrical  data  path 
within  a  smart  pixel  array.  Each  bold  horizontal  line  represents  an  optical  connection  which  is 
implemented  by  programming  the  pixels  at  each  rad  as  transmitters  and  receivers.  The  embeddings 
can  be  changed  in  real-time  by  down-loading  appropriate  control  bits. 


Pig.  2.  Sort  pixel  «C  wMhaddnif  noogafciaa 


The  HyperPlane  architecture  can  also  be  designed  to  exploit  transparent  long  distance  optical 
transmissions.  By  appropriately  designing  the  extender  card,  it  is  possible  to  allow  long-distance 
optical  transmissions  to  by-pass  intermediate  smart  pixel  arrays  as  they  travel  down  the  backplane 
[1].  This  "Transparent  HyperPlane"  improves  performance  by  eliminating  die  delays  associated 
with  passing  through  inter-mediate  smart  pixel  arrays.  Since  the  intermediate  smart  pixels  can  no 
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longer  transmit  or  receive  over  the  long-distance  channels  which  bypass  them,  there  may  be  a 
restriction  on  the  types  of  embeddings  possible. 


[1]  H.S.  Hinton,  Canadian  Institute  for 
Telecommunications  Research  -  Research 
Program  1993-94,  pp.  143-156, 1993. 

[2]  K.  Hamanaka,  “Optical  Bus  Inter-connection 
using  Selfoc  Lenses",  Optics  Letters,  Vol.  16,  No. 
16,  pp.  1222-1224, 1991. 


{3]  L.A.  D'Asaro  et  al,  "Batch  Fabrication  and 
Operation  of  GaAS-Alx-Ga/.xAs  Field-Effect 
Transistor  Self-Electrooptic  Effect  Device  (FET- 
SEED)  Smart  Pixel  Arrays",  IEEE  Journal 
Quantum  Electronics,  Vol.  29,  No.  2,  Feb.  1993, 
pp.  670-678. 
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Abstract  -  Ws  present  a  aew  optical  iatarcoaaectioa  set  work,  called  aa  Optical  Mehi-Meth  Hypercabe,  wUck  is 
botk  das-  aad  generation-scalable.  It  exhibits  sack  properties  at  small  diameter,  fault  toleraace,  symmetry,  coaetaat 
sods  degree,  aad  is  highly  aawaable  to  optical  impfameatations. 

Samnmry- The  meet  far  IWaiops  (1011  loatiac  point  opera  tines  per  seeoad)  sa  perrons  peter*  faeled  by  the  laanch- 
iag  of  the  High  Performance  Competing  aad  Commaaicatkm  Program  is  patting  major  emphasis  oa  exploiting  massive 
paraBsMem  with  greater  thaa  oae  thoneaad  proceeelag  demeats  (PEe)  networked  to  farm  massively  par alia  compat- 
ere(l(PCe)(l].  Aa  important  ieene  in  the  design  of  If  PCs  is  scalability.  Ia  architactaral  terns,  scalability  has  two 

paraBal  computers  ere  those  ia  which  the  member  of  an  dm  (or  processors) 
caa  be  expanded  over  a  wM*  1 


m  (or  pro 

'ange  withoat  eebetaatial  rh  eagre  fa  either  the  existing  coaflgmratioa  or  software,  aad  the 
r  a  amber  of  nodes  iacreeem.  Ceneratina  scalability  is  the  adaptability  of  the  architect  are 

to  the  rapid  oeolatioa  haologjas.  The  hey  to  else  scalability  of  MFCs  is  the  iatarcoaaectioa  network  which  is 
also  a  deriding  factor  ia  t«mi  of  performance  wad  cost  of  the  entire  system.  Numerous  topologies  hare  been  explored 
far  pnrattsi  computers.  However,  the  lack  of  siae- scalability  of  soam  of  theae  networks  (e.g.,  hypercabe)  have  limited 
their  see  fa  MFCs  despite  their  auay  other  advantages.  We  have  explored  a  aew  network  topology,  called  Optical 
MmM-ilttk  Hypercabe  (OMMH),  which  combines  the  advantages  of  both  the  hypercabe  (small  diaaaater,  Ugh  connec¬ 
tivity,  symmetry,  simple  coetral  aad  ranting,  faah  tolerance,  etc.)  aad  the  mesh  (coaetaat  node  degree  aad  scalability) 
topologies,  while  dream veatfag  their  disadvantages  (lack  of  acafabffity  at  the  hypercabe,  aad  large  diameter  of  the 
meek).  We  have  also  developed  a  tkrse-dimeasioeal  (3-D)  optical  implementatioa  methodology  which  exploits  the 
advent  age  of  both  space- invariant  free  apace  aad  mnhiwavefangth  f  bar-baaed  optical  interconnects  technologies. 
DSfegfau  of  the  OMMH  network:  Aa  (!,m,a)-01fhiH,  where  l,m,aad,n  an  integers,  network  coneiets  of  1 X  m  X  2" 
nodes  sad  aa  address  of  a  node  has  three  components;  (i,j,k),  where  0  <  s’  <  1,  0  <  j  <  n»,  0  <  k  <  2",  and 
*,j,k  are  integers.  The  topology  of  aa  (i,m,  ni-OMMH  network  is  dsdaed  by  ive  iaterconaactkm  feactkms  far  a 
(»'>•*)“  /ha°”:  (})  A., (»•>.*)  =  ((•  +  1)  mod  I,  j,  *),  (2)  JLf(iJ,k)  ‘ =  ((|  +  i  -  1)  mod  1,  j,  k),  (S) 
/■.(•,.!.*)«(«,  0  +  1)  mo&  m,  *),  aad  (4)  £,(«,;,*)«(•,  (m+j-l)aod*,  *)  Ae(»,y,*^»  - •  •*e+ihe*r-i  -  *o)  = 
(ijtkn-i  •  •  ■  kj+ttjki-t  •••*#),  far  4  *  0,l,---,n-  1,  where  h«-i  •••*#  fa  a  biliary  representation  of 

fategm  k.  Tores  (nnl  with  wraparanad  conn  act  ions  fa  the  rows  aad  colamas)  connections  of  tha  OlfMH  network 
arc  dsdasd  by  A,,,  Apiaf  jt>,.  Tha  binary  n-enbn  iaterconaection  fa  dedned  by  J far  d  m  0,1, •••tn  -  1. 
Flg.1  shows  a  (2,4,3)-OMMH  interconnection  when  sold  Base  represent  hypercabe  Bake  aad  dashed  lines  represent 
tons  Bake.  For  a  network  afae  of  owe  mflHoa  nodes,  the  hypercabe  network  contain*  abont  10.5  millirm  Bake  while  the 


(f,m,4)-OMMH  has  abont  4.2  mflBoa  Bake  aad  (16, 16,*)-OMMH  has  approximately  8.4  million  Bake.  Since  oae  link 
impBae  oae  physical  path,  electrical  or  optical,  between  two  nodes,  the  OMMH  network  fa  cost-effident  compand  to 
tha  regains  hyperenba  network  fa  terms  it  hardware  tnanir— it.  It  shonld  be  noted  that  aa  OMMH  with  a  constant 
cnbe  as  a  basic  bnfldfag  block  (ex.,  (l,m,4)-OMMH)  has  coaetaat  node  degree  over  the  network  else.  This  feature 
Peebles  OlfMH  network  to  be  enJaMr,  that  fa,  the  rise  of  the  OMMH  caa  be  increased  by  expanding  the  sin  of  the 
tone  withoat  changing  node  degree. 

Optical  Implementation  oj  the  OMMH  aeftracfc  Aa  OMMH  network  fa  cnaetractad  from  simple  baildfag  blocks  (by- 
percabes)  fa  a  modular  aad  facraaaeatal  fashion.  Those  baBdfag  blocks,  once  coast  rmeted,  are  left  sadist  orbed  when 
the  network  grows  fa  else.  The  OMMH  caa  be  viewed  as  a  two-favel  interconnection  network:  high-density,  local 
connections  far  hypercabe  Hake  (within  a  bade  modal*),  aad  high  bit  rate,  low-density  aad  long  connections  for  the 


*TMs  watch  was  sappmtsf  far  an  HOT  great  He.  HOP  tllSMC,  aa  NSF  sraat  No.  MIP  M100S3,  aad  a  gnat  bam  USWast. 
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tana  Saks  connecting  the  baric  building  blocks.  Tie  optical  impiemeatatioa  alao  consist*  of  two  levels:  free- apace 
mm-iawiaat  optica  Cor  tie  coostrectioa  a f  beaic  beildiag  Mocks,  aad  meltiwavelaagtk  fiber*  lor  tie  tare*  limk*. 
Tie  break  doom  of  fractional  requirements  for  tie  OMMH  network  ia  consistent  with  tie  advantages  of  free- apace  aad 
optical  fiber  teckaofogiea.  The  aiae  of  the  OMMH  cam  be  iacraaaed  by  adding  hypercebe  modules,  which  provide*  mod- 
elarity  aad  atae-ecalahifity.  Caaaratioa-acalahility  ia  provided  by  the  eaa  of  high- bandwidth  wavelength  maitipiezad 
optic*  which  woeld  watch  coaaaiaaicatioa  bandwidth  r*qnir*wU  of  fotere  proraaaiag  *lm*et*. 

The  totally  apace- in  variant  optical  implementation  of  hypercebe*  ia  provided  ia  detail  in  Baft.  2  and  3.  Baaed  on 
thee*  method*,  an  (/,  m,  n)-OMUH  can  be  corut  ructed  aa  follow*:  (1)  /  x  m  n-cnbe  mod  alee  aa  deacribed  In  Kef.  2  aad 
3  are  placed  in  a  I  x  m  matrix  form.  (2)  /  x  m  node*,  each  of  which  ia  from  the  eame  location  of  th*  n-cabe  model**, 
at*  connected  to  form  a  tore*  of  dimeerioa  l  X  m.  (3)  Stop  2  i*  repented  util  every  node  i*  connected,  molting  ia  2" 
toman*  of  rise  /  x  m. 


Now,  we  consider  aa  optical  implementation  of  tie  (/,  m,  a)-OMMH  network.  We  nae  two  optical  componaata:  a 
qaadraat  beam  iplitter  (QBS)  which  tplit*  a  riagie  beam  into  four  beam*  (ti*  QBS  also  combine*  fonr  beam*  into  one 
riac*  it  ia  br-directionnli  aad  nn  r-ckaaari  wavefeagth  mahi plexor  (WMUX)  which  mmitiplexe*  beam*  with  « diCereat 
wavelength*  into  a  riagie  beam  (alao  demnltlplexe*  riace  it  i*  bi-directional).  We  aanme  that  each  nod*  has  two  light 
■onto**;  one  aonree,  £*,  to  generate  th*  req  aired  hypercebe  Unks  aad  tie  aacoad  aonre*,  S,,  i*  con  pled  with  u  optical 
fiber  for  tie  torn*  Maks.  A  QBS  i*  attached  to  every  St  to  provid*  tie  fonr  faaonta,  St„,  S,t,  S,M,  end  S,w  (north, 
■oath,  east,  aad  weat).  A  WMUX  i*  located  at  both  end*  of  each  row  aad  each  column.  Let  each  WMUX  at  tie  right 
ead  of  a  row  be  WMUX*,  each  WMUX  at  tbe  left  aad  of  a  raw  be  WMUXw,  each  WMUX  at  the  top  of  a  column  be 
WMUXiv,  aad  each  WMUX  at  tbe  bottom  of  a  colama  bo  WMUX*.  In  a  given  row,  a  WMUX*  maltiplexes  light* 
from  tie  Sta  aoarco*  of  that  row  into  a  single  fiber  which  ia  then  connected  to  a  WMUX*-  ia  the  neighboring  a-eabe 
module.  Similarly,  £t_a  5(j»,  and  5<lrs  are  multiplexed  by  WMUXjy,  WMUXs,  aad  WMUXtr,  roapectlvely.  Fig.2 
show*  an  (2,4,5)-OMMH  implementation.  More  detail*  will  be  provided  at  the  Meeting. 


Fig.  2:  (2,4f5)'OMMH  implementation. 

References 

1.  G.  Bed,  “Ultracomputera:  A  TeraBop  Before  Its  Time”,  ComnrantcaJion*  of  ACM,  99,  Aag.  1992,  pp.  27-47. 

2.  A.  Loan  aad  H.  Snag,  “Design  Methodology  for  Tkree-dimearioaal  Space- in  variant  Hypercebe  Networks  with 
Graph  Bipartitfoniag”,  Optics  Letter*,  19(23),  1993,  pp.  2050-2052. 

3.  A.  Loan  aad  H.  Song,  “Efficient  Implementation  M ethology  for  Three- dimensional  Space-invariant  Hypercube- based 
Optical  Interconnection  Networks”,  Applied  Optic*,  92(35),  1993,  pp.  7200-7209. 


WD4/225 
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Abstract:  A  3D-integrated  microsystem  is  presented  which  performs  the  overlay  of  data  planes  Each  channel 
of  the  microsystem  consists  of  two  gradient-index  microlenses  and  two  microprisms.  The  output  plane  is  500pm 
squared  and  contains  approx.  100  Pixels 


Introduction:  3D-integrated  microsystems  promise  a  higher  packaging  density  and  complexity  for  optical 
interconnects  and  functional  elements  compared  to  2D  OEICs  [1],  However,  the  additional  degrees  of  freedom 
demand  aAHtinwl  /•ffnruc  for  the  aligning  and  mounting  of  the  systems  This  is  critical  especially  few  diffractive 
optical  elements.  We  present  a  very  simple  refractive  system  performing  the  overlay  of  two  data  planes  which  is 
a  vital  step  in  symbolic  substitution  concepts  for  digital  optical  computing  [2]  [3], 

Experimental  Setup:  The  optical  design  of  the  system  is  straightforward.  To  perform  an  overlay  of  two  data 
planes,  imaging  and  deflecting  dements  are  needed.  It  would  alio  be  possible  to  build  a  system  containing  only 
lenses  if  those  are  used  off-axis.  However,  we  use  the  lenses  on-axis  and  shift  the  data  planes  with  prisms. 
Again,  there  are  two  possibilities.  One  is  to  use  only  a  single  lens  and  pupil  division,  the  other  to  use  two 
lenses.  The  second  option  has  the  advantage  that  the  whole  space-bandwidth  of  the  lens  can  be  used. 
Additionally,  the  area  where  the  two  prism  surfaces  meet  is  not  perfect  (rounded)  and  this  area  is  not  in  the 
center  of  the  lens  but  between  the  lenses.  In  all  three  cases  (Fig.  1)  it  is  necessary  to  adjust  the  illumination 
according  to  the  imaging  conditions. 


Fig.  1  Three  possible  designs  of  a  system  performing  the  overlay  of  two  data  planes 

The  focal  length  of  the  lenses  (1 100pm),  the  pitch  of  the  leases  (250pm)  and  the  refractive  index  of  the  prisms 
(1.49)  determine  the  prism  angle  (17°).  In  our  setup,  the  data  planes  are  illuminated  test  patterns.  We  use  two 
LEDs  to  be  able  to  swich  the  two  channels  independently.  The  LEDs  are  demagnified  by  a  microscope 
objective  and  imaged  in  the  censers  of  the  lenses.  The  output  plane  is  imaged  on  a  CCD  array  by  another 
microscope  objective. 


Fig.2  The  experimental  setup 

Fabrication:  The  lenses  were  made  by  use  of  the  Na-Ag  km  exchange  in  glass  [4].  The  prism  array  was  made 
by  thermal  moulding  and  casting  [5]  (Fig.  3).  Firstly,  a  glass  master  of  the  prams  was  made  using  standard 
techniques  The  master  was  heated,  tntboaaed  into  a  PMMA  substrat  and  cooled  down.  Repeating  this  process 
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a  regular  array  of  negative  prisms  The  negative  array  is  now  used  as  master  for  the  production  of  the 
BmI  prism  array.  For  this  purpose  it  is  filled  with  UV-cnring  glue  and  topped  with  the  planar  microlens  array. 
ShM  the  performance  of  the  system  primarily  depends  on  the  prism  angle,  the  alignment  of  the  negative 
microprism  array  and  the  microlens  array  is  not  very  critical  apart  from  a  relative  rotation.  After  curing  of  the 
glue,  die  negative  is  separated  from  the  complete  system. 
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Fig.3  Fabrication  of  the  microprism  array 

Experimental  results:  Fig.  4  shows  a  top  view  of  an  array  of  microprisms  fabricated  on  an  substrate  which 
already  is  an  array  of  planar  microlenaes  Fig.  5  and  6  show  images  of  the  both  dataplanes  independently. 
Fig.  7  shows  the  overlay  of  the  two  dataplanes 


X  X 

■•vX-. 

1  .„. 

i;  X  x 

A 

J  ft  X 

:n 

1 ...  -. ■ 

f  _ ] 

Acknowledgements:  The  microlenses  were  made  by  J.  Bur.  The  help  of  W.  Eckert  and  W.  Singer  with  the 
experimental  setup  is  gratefully  acknowledged.  Parts  of  this  work  are  funded  by  the  german  ministry  of 
research  and  technology  (BMFT)  under  grant  TK  0585/5. 


References: 

[1]  F.B.  McCormick,  " Optical  Hardware  Design  for  Free-Space  Optical  Computing  and  Switching ", 
Proceedings  of  16th  Congress  of  ICO  1993,  Budapest,  p.  352-356 

[2]  K.-R  Brenner,  A.  Huang,  N.  Streibl,  " Digital  optical  computing  with  symbolic  substitution",  Appl.  Opt. 
25(18),  p.  3054-3060  (1986) 

[3]  K.-H.  Brenner,  W.  Eckert,  C.  Passon,  "Demonstration  of  a  systolic  array  optical  adder  based  on  symbolic 
substitution",  accepted  by  Optics  an  Laser  Technology  Jan.  1994 

[4]  K-R  Brenner,  J.  Moisei,  W.  Singer,  S.  Sinzinger,  T.  Spick,  M.  Testorf:  "Diffusion  elements  in  glass: 
comparison  and  optimization  of  the  diffusion  response  in  different  substrates".  Proceedings  1992  ICO  Topical 
Meeting  on  Optical  Computing.  Minsk,  p.  234-242 


[5]  K.-R  Brenner,  "Three  dimensional  microoptical  integration  techniques".  Proceedings  of  16th  Congress  of 
ICO  1993,  Budapest,  p.  98-104 


WD5/227 


INTEGRATION  OF  FREE- SPACE  INTERCONNECTS  USING  SELFOC  LENSES: 

OPTICAL  PROPERTIES  OF  A  BASIC  UNIT 

Kenjlro  HAMANAKA.  Kenichl  NAKAMA.  Daisuke  ARAI .  Yukihlsa  KUSUDA. 

Takashi  KISHIMOTO.  and  Yoshinobu  MITSUHASHI 
RWCP*  Optoelectronics  NSG  Laboratory^ 

A  vertical  and  horizontal  interration  technique  of  free-space 
Interconnects  using  Selfoc  lenses,  is  described.  A  basic  unit  of  the 
optical  Bother  board  has  been  fabricated  by  slicing  a  Selfoc  rod  of 
4 mm  in  diaaeter. 

1 . INTRODUCTION 

In  case  of  integrating  free-space  optical  systems,  surface 
mounting  techniques  are  preferable  for  fixing  non-transparent  devices 
such  as  surface  emitting  LDs  and  ref lect ion- type  SLMs.  while 
transparent  devices  such  as  transparent-type  SLMs  and  Isolators  must 
be  inserted  in  optical  paths.  Therefore,  it  is  required  to  develop 
optical  mother  boards  for  Integrating  free-space  optical  systems, 
which  are  suitable  for  both  non-transparent  and  transparent  devices. 
It  is  considered  that  "Optical  bus  interconnection  system  (OBIS) 
[1,2]",  an  optical  mother  board  using  Selfoc  microlenses  (SMLs)  [3]. 
is  one  of  the  promising  solutions. 

2.  PRINCIPLES 

Figure  1  shows  the  basic  concept  of  OBIS.  An  optical  mother 
board  owning  many  optical  I/Os  on  a  glass  substrate  as  well  as  in  the 
substrate  is  composed  of  cascade  arrays  of  SMLs  and  prisms.  All  the 
I/Os  are  basically  conjugated  each  other,  and  as  the  result,  two 
dimensional  Information  generated  at  one  of  the  I/Os  is  transferred 
into  the  other  I/Os  through  the  image  transmission  tubes  composed  of 
the  SMLs.  The  perfectly  coaxial  arrays  of  SMLs,  which  are  fabricated 
by  slicing  Selfoc  rods  using  diamond  blades,  provide  large  space- 
bandwidth  product.  Non-transparent  devices  can  be  Integrated 
horizontally  at  the  optical  I/Os  on  the  substrate  using  surface 
mounting  techniques,  while  transparent  devices  can  be  integrated 
vertically  at  the  I/Os  In  the  gaps.  The  substrate  plane  Including  the 
optical  I/Os  can  also  be  used  for  electronic  circuits  as  well  as 
planar  optical  waveguides.  Various  interconnection  networks  can  be 
realized  using  microlens  arrays  [4]  or  DOEs  in  the  gaps. 

3.  EXPERIMENTS 

A  basic  unit  of  OBIS,  by  means  of  a  4-f  telecentric  imaging 
system,  was  fabricated  by  slicing  a  Selfoc  rod  of  4mm  in  diameter 
(Fig. 2).  After  the  rod  and  two  glass  substrates  with  grooves  were 
fixed  using  UV-resln,  perpendicular  gaps  were  fabricated  using  a 
diamond  blade  composed  of  diamond  particles  of  20-30pm  in  diameter 
(Fig. 3).  Roughness  of  the  SML  facets  and  the  substrate  planes  was 
measured  to  be  Rmax-2.2)im  in  average.  The  4-f  telecentric  imaging 
system  owing  two  optical  I/Os  on  the  substrate  was  formed  after 
combining  two  prisms  using  index-matching  liquid  (Fig. 4).  The  optical 
configuration  has  been  designed  to  minimize  aberration  within  the 
image  height  of  0.8mm,  while  NA  of  the  light  beams  is  equal  to  0.1. 
Point  sources  of  the  corresponding  NA,  i.e.  focusing  spots  of  an 
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alaost  diffraction  Halted  planar  alcrolens  array  of  NA-0.1  [S].  were 
located  at  the  object  plane,  and  PSFs  were  observed  under  coherent 
llluainatlon.  The  PSFs  of  16 pa  In  dlaaeter  In  average  with  no  critical 
scattering  were  obtained  In  the  1.6aa  square  laage  plane  on  the 
substrate  through  the  reflection  path  (Flg.S)  as  well  as  through  the 
transparent  path. 


A  vertical  and  horizontal  Integration  scheae  of  OBIS  has  been 
proposed  and  a  basic  unit  of  OBIS  fabricated  by  slicing  a  Selfoc  rod 
of  4aa  in  dlaaeter  has  been  exaalned. 
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Abstract 

We  describe  a  technique  for  assembling  fiber  arrays  as  needed  in  optical  computing  and  photonic 
switching.  A  4x8  array  was  manufactured  with  fiber  ends  to  within  1.5  pm  from  their  ideal  position 
and  to  a  pointing  precision  of  30  arc -minutes. 

Summary 

Optical  fiber  bundle  mays  are  simple  to  conceptualize  but  their  fabrication  has  proven  to  be  difficult 
when  high  precision  positioning  is  required.  Several  fabrication  techniques  have  been  reported  in  the 
literature;  for  example,  in  die  approach  presented  by  Miller*11  a  2-D  array  of  fibers  was  made  by 
slacking  a  number  of  linear  arrays  of  fibers  supported  by  grooved  spacers.  These  spacers  were 
manufactured  by  the  precise  etching  of  both  sides  of  a  silicon  wafer,  and  a  polishing  operation  was 
involved  after  potting  all  die  fibers  in  place.  Recently,  this  technique  has  been  used  by  Danzer, 
Kipfer,  Ztrl,  Lindotf,  and  Sch wider®  to  assembly  a  fiber  array  with  a  maximum  positioning  error  of 
10  pun.  In  another  effort  an  alignment-free  assembly  technique  has  been  developed  by  Sasaki,  Baba, 
and  Igapl  where  fiber  end  posiconing  was  accompihhed  to  within  ±  8  pun.  In  this  technique  an  array 
of  fiber  sockets  with  centering  plugs  were  micro-fabricated  to  achieve  fiber  self-centering  insertion 
and  to  expedite  assembly.  Koepf  and  Marfcey*41  have  reported  a  technique  involving  arrays  of 
precision  holes  in  substrates  to  insert  and  locate  optical  fibers  with  a  standard  deviation  of  12.6  pm. 
Arrays  of  precision  holes  to  position  fibers  have  also  been  used  by  Basavanhany*51  and  by  Proudley, 
Stace,  and  White*61  to  manufacture  fiber  bundles.  Hie  insertion  of  fibers  in  microferrules  and  their 
stacking  to  create  a  2-D  fiber  array  has  even  been  realized*71  with  a  mean  fiber  positioning  error  of  3 
pun. 

A  common  feature  of  these  techniques  is  that  fiber  positioning  is  accomplished  by  referencing  the 
fiber  to  a  mechanical  jig  and  this  limits  the  ultimate  precision  attainable.  We  considered  these 
assembling  techniques  but  were  not  satisfied  with  their  inherent  errors  or  the  processes  involved.  Thus 
we  developed  the  assembly  technique  that  we  present  which  suits  our  photonic  switching 
development  The  basic  idea  around  which  our  technique  revolves  is  the  high  precision  and 
individual  positioning  of  each  optical  fiber  on  a  substrate  of  large  and  low  precision  holes.  Instead  of 
relying  on  a  mechanical  substrate  to  reference  the  fiber  we  obtain  positioning  accuracy  by  individually 
looting  each  fiber  core  in  the  correct  position.  The  positioning  accuracy  is  achieved  by  referencing 
fight  coming  from  the  fiber  core  to  a  lithographically  made  array  of  annuluses  or  doughnuts  on  a 
transparent  glass  substrme.  Once  a  fiber  is  positioned,  it  is  bonded  in  place  by  curing  UV  cement 
around  it  The  array  of  holes  serves  as  the  foundation  for  the  fiber  array,  to  achieve  uniformity  in  fiber 
pointing,  and  to  sttern  relieve  the  optical  fibers. 

To  assemble  a  fiber  bundle  a  bole  array  substrate  is  filled  with  UV  curing  cement  and  an  array  of 
centering  donghnmi  is  registered  on  the  top  surface  of  the  hole  array.  The  centering  doughnuts  face 
Iheamy  of  holes  and  are  centered  with  respect  to  the  holes.  The  assembly  formed  by  the  hole  array 
and  centering  doughnuts  substrate  is  placed  on  a  microscope.  Prior  to  fee  fiber  insertion  and  alignment 
steps  the  fiber  coating  is  stripped  off  and  the  fibers  cleaved.  With  the  aid  of  a  fiber  manipulator  a  fiber 
is  grabbed  amt  inserted  through  the  correspondmg  hole  in  the  hole  array.  The  light  that  comes  from 
the  fiber  core  and  tome  TV  cameras  make  this  insertion  easy.  The  fiber  can  pivot  on  the  lower  rim  of 
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the  hole  so  (hat  it  can  be  aligned  by  horizontally  moving  the  fiber  manipulator.  Then  the  fiber  is 
brought  into  contact  with  the  doughnut  substrate.  For  this  delicate  step  a  piezoelectric  driven 
mechanism  is  used.  Once  the  alignment  has  been  done  a  pinhole  is  positioned  on  top  of  the  fiber  core 
to  only  allow  the  cement  in  the  hole  of  interest  to  be  exposed  to  UV  light  At  this  time  the  fiber 
position  is  verified  and  then  a  UV  light  gun  is  shone  on  top  of  the  pinhole  to  cure  the  cement  and  bond 
the  fiber  to  the  hole  and  substrate.  The  bond  holds  the  fiber  in  position  and  also  helps  to  stress  relieve 
the  glued  fiber  end  from  the  rest  of  the  fiber.  Other  fibers  are  glued  in  a  similar  fashion  and  in  an 
orderly  way  to  avoid  interference  between  the  optical  fibers  as  they  are  handled  and  moved  under  the 
hole  stray  substrate.  Physical  interference  between  fibers  is  avoided  by  slightly  bending  the  fibers 
after  gluing  so  that  all  the  fibers  are  moved  out  of  the  way  of  the  next  fiber.  The  elapsed  time  between 
each  fiber  alignment  and  gluing  is  approximately  10  minutes.  Using  the  technique  described  we 
manufactured  several  fiber  arrays.  The  most  recent  one  has  32  single  mode  fibers  at  1300  nm  in  a  4x8 
array  and  a  distance  of  500  pm  between  fiber  centers.  The  fiber  ends  were  located  to  within  1.5  tun 

from  their  ideal  position  and  to  a  pointing  precision  of  30  arc-minutes. 

• 

We  have  developed  a  technique  for  assembling  fiber  bundle  arrays  with  unproved  fiber  positioning 
accuracy.  The  technique  does  not  depend  on  fiber  core  concentricity,  on  fiber  dimensional  uniformity, 
dr  on  a  final  polishing  step.  Except  for  the  use  of  lithography  for  fabricating  the  centering  doughnuts 
all  the  equipment  necessary  to  implement  this  assembly  technique  is  simple  and  is  easily  obtained. 
We  use  active  fiber  alignment  to  center  illuminated  fiber  cores  within  lithographically  made 
referencing  doughnuts  to  achieve  micrometer  precision  in  fiber  end  positioning.  In  addition,  the  array 
of  centering  doughnuts  makes  the  detection  of  alignment  errors  straightforward  and  limited  by  the 
microscope  resolution.  The  assembly  method  is  very  well  suited  for  die  manufacture  of  small  arrays 
and  for  the  development  of  optical  computing  and  photonic  switching  systems  where  only  a  few  fiber 
arrays  are  required  but  with  changing  requirements.  Some  drawbacks  of  the  technique  are  the 
assembly  time  required  to  construct  large  arrays,  and  that  it  takes  only  one  broken  or  mispasitioned 
fiber  to  rain  an  entire  array.  This  latter  drawback  is  common  to  all  techniques.  Thus  fiber  assembly 
reliability  and  yield  are  important  figures  of  merit  to  compare  assembly  techniques.  For  future  work 
we  plan  to  automate  the  technique  to  decrease  assembly  time  and  increase  reliability.  We  also  plan  to 
manufacture  larger  fiber  arrays,  study  fiber  array  repairability,  dimensional  stability,  and  incorporate 
arrays  of  miciolenses  to  match  the  numerical  aperture  of  the  fiber  bundles  to  that  of  our  photonic 
systems. 

We  would  tike  to  thank  A.  L.  Lentine,  R.  L.  Morrison,  F.  B.  McCormick,  T.  J.  Cloonan,  R.  F. 
Huisman,  R.  J.  Crisci,  G.  M.  Pioudley  and  M.  R.  Taghizadeh  for  helpful  discussions. 
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ABSTRACT 

fteo-ggce  optical  schemes  lo  minimize  the  number  of  switches  far  noo-htodriag  mchicsst  and 
braadcat  ilBrooinectsppiiaaioM  are  introduced  and  CTpcrimeMallydanoBttmBd  baaed  on  the 


A  nonrblodring  generalized  switching  network  erf  N  nodes  is  die  one  where  each  input 
can  broadcast  or  multicast  its  message  to  any  combination  of  output  nodes  without  experiencing 
any  internal  blocking.  In  case  that  one  input  is  to  be  multicast  to  j,  j  <  N,  outputs  the  remaining 
N-j  output  nodes  can  still  be  accessed  by  any  of  die  remaining  input  nodes  without  experiencing 
internal  blocking.  Using  this  criteria,  most  popular  point-to-point  switching  networks  do  not 
qualify  20  handle  non-bloclting  communications  for  die  generalized  switching  applications. 
Using  die  space  switching  concept  a  cross-bar  network  is  about  the  only  one  which  handles 
strictly  non-blocking  general  purpose  networking.  A  cross-bar  has  to  use  N2  basic  switches 
making  die  hardware  implementation  of  a  large  network  a  very  difficult  male.  Hie  reduction  of 
switching  complexity  far  a  general-purpose  striedy  non-blocking  network  is  not  possible  using 
a  purely  ID  switching  technique  since  such  a  network  has  to  make  available,  at  each  of  its  N 
output  nodes,  all  its  N  input  signals  so  that  any  output  node  can  independendy  select  any  input 
signal. 


In  our  approach,  instead  of  a  purely  ID  switching  the  multiple  multiplexing  techniques 
are  used  [1].  Hoe,  the  multiple  multiplexing  implies  that  the  explicidy  multiplexed  signals  in  he 
time,  wavelength,  or  space  domain  need  to  be  multiplexed  one  on  top  of  the  other  again.  Let  us 

assume  for  a  general  lb-tuple  multiplexed  system  of  N  nodes  with  N  =  to  H2  •••  Wk,  each  has  a 
unique  identity  composed  of  k  indices.  Since  each  of  die  N  receiving  nodes  receives  entire 
inform  from  all  the  input  nodes,  to  select  an  input  at  an  output  node,  the  receiver  has  to  access 
each  of  the  t  dimensions  and  to  select  (by  demultiplexing)  the  corresponding  channels. 

Selecting  a  specific  sub-channel  in  dimension  d  is  equivalent  to  a  complex?  /  of  0(^1),  r.e. 
he  number  of  basic  2x2  switches  is  OiHd-1),  and  for  JV  users  he  total  switch  count  is 
1)N).  Thus,  the  total  complexity  of  the  network  is  O  [(m  +  fi2  +...  +m~k)NJ.  Since  all  m 
(i  =  12  ...  k)  are  symmetrical,  a  minimum  complexity  will  occur  when  aU  w  are  identical 

Thus,  a  minimum  complexity  of  kN(NI/k-l)  is  obtained  for  m  ■  P2  * * Mk  mNIlk.  The 
overall  complexity  can  be  reduced  by  increasing  k,  however,  since  die  minimum  number  of 
sub-channels  in  any  dimension  is  two,  no  more  than  logjN  dimensions  can  be  used  for  a  total 
of  N  signals.  Thus,  for  *  =  log2N,  the  complexity  approaches  the  absolute  limit  of  Nhg2N 
which  ties  the  information  lower  bound  pointed  out  by  Shannon  [2].  Using  electronics,  a  k-2 
system  (time,  and  space)  can  be  implemented.  The  use  of  optical  fiber  components  allows  k=3 
with  an  additional  wavelength  dimension.  When  the  free-space  optics  is  used,  angular 
dimensions  in  two  perpendicular  planes  as  extra  degrees  of  freedom  can  be  included  mating  a 
k=5  system  feasible.  One  additional  advantage  of  he  multiple  multiplexing  is  that  instead  of 
squeezing  all  complexity  into  a  single  dimension  making  the  system  working  at  its  extreme 
limit,  each  dimension  handles  its  fair  share  of  switching  complexity  at  its  comfortable  range. 
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To  confirm  the  proposed  multi¬ 
dimensional  switching  concept,  an  N=9, 
k*2  system  utilizing  two  mutually 
orthogonal  angular  dimensions  is 
sketched  in  Fig.l  far  its  top  and  side 
views.  For  the  broadcast  and  select 
operations,  respectively,  a  2D  Dammann 
grating  and  two  ID  optical  deflector  arrays 
are  employed.  The  Dammann  grating  splits 
an  input  beam  into  9  angularly  encoded 
beams  and  multiplexes  them  with  the  help  of 
a  spherical  lens  into  9  spatial  spots  to  be 
selectively  switched  by  the  two  beam 
deflector  arrays.  The  first  and  second 
deflector  arrays  select  one  out  three  angles 
along  the  XZ  and  YZ  planes,  respectively. 
In  this  way,  instead  of  using  81  switching 
states,  only  2x9x3  =  54  switching  states  are 
used.  The  larger  tire  N,  the  mare  savings  of 
the  switching  states. 

The  angular  selective  switching  can  be  implemented  using  a  multichannel  A-0  deflrytor 
array.  Since  the  number  of  angular  channels  is  fundamentally  related  to  spatial  and 
frequency  resolution,  various  technolo¬ 
gical  and  fundamental  limitations  restrict 
the  network  interconnect  capacity.  We 
have  performed  calculations  of  limitation 
on  such  a  capacity  based  on  parameters 
of  tire  multichannel  A-0  deflectors.  Fig.2 
plots  the  network  capacity  limits  due  to 
the  A-O  deflector's  time-bandwidth 
product,  the  grating's  space-bandwidth 
product,  and  the  interfacing  numerical 
aperture,  where  the  parameters  are:  fj, 
the  focal  length  of  the  lens;^,  tire  central 

frequency  of  the  A-O  deflector.  X,  tire 
optical  wavelength,  z  and  h,  tire  thickness 
and  wir  th  of  the  transducer  of  each 
deflecta  and  s,  the  spacing  between  two 
consecutive  deflectors.  Mare  than  32 
angular  channels  can  be  used  in  each 
dimension  making  a  system 
interconnecting  more  1,024  nodes 
possible. 
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Fig  .2:  Interconnect  capacity  limit  due  to  fundamental 
A  technological  constraints. 
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Fig.l:  Top  and  side  views  of  a  free-space  insercoo- 


nect  using  double  angular  multiplexing. 


Experimental  verification  of  the  proposed  k=2  system  was  performed  using  two  8- 
channel  TeOj  A-0  deflector  arrays  fabricated  by  Brimrose  [3].  In  the  proof-of-principle 
experiment,  the  N«64  system  was  built  and  tested.  Experimental  conditions  and  results  will  be 
presented  at  tire  conference. 


[1]  J.  Sharony,  T.  Stem,  and  Y.  Li  "Universality  of  the  multidimensional  switching 
networks,"  IEEE! ACM  Tran.  Network,  submitted. 

[2]  C  E.  Shannon,  "Memory  requirements  in  a  telephone  exchange,"  Beil  System  Tech.  J., 
29(1950)343-349. 

[3]  Y.  Li,  et  aL,  "Minimum  complexity  free-space  optical  nonblocking  networks  for 
multicast  interconnect  applications,"  Opt.  Lett.,  19  (1994)  to  appear. 
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We  discuss  interconnection  networks  applied  to  a  general  parallel  computing  architecture.  Several  criteria  enable 
us  to  choose  a  specific  network.  An  optoelectronical  system,  is  proposed.  We  present  an  optical  model  and  the 
corresponding  experimental  results. 


Photons  and  matter-photons  interactions  are  used  in  communications,  interfaces  and  processing 
inside  digital  parallel  computers.  Parallelism  involves  a  great  number  of  elementary  processing 
units  (PEs)  working  together.  Communications  between  these  PEs  or  to  the  external  world  are 
primordial  for  the  global  efficiency  of  the  computer.  For  a  given  number  of  PEs,  communica¬ 
tions  have  an  influence  on  the  data  flow  and  the  degree  of  freedom  which  is  necessary  to 
implement  a  suitable  interconnection  pattern. 

A  theorical  analysis  of  different  types  of  interconnection  networks  and  the  experimental  results 
of  a  particular  hP-^N22  network  are  presented.  In  such  an  architecture,  the  applications  which 
can  be  supported  may  be  as  varied  as  data  base,  image  and  signal  processing,  matrix  operations. 

The  needs  of  data  movements  required  by  these  applications  enable  us  to  extract  a  communi¬ 
cation  kernel :  we  discuss  its  compatibility  with  free-space  optical  technology.  According  to  the 
optical  SBWP  (Space  Bandwidth  Product)  parameter,  a  bidimensional  optical  multistage  in¬ 
terconnection  network  is  achievable  even  if  the  degree  of  parallelism  is  high.  Moreover,  one 
system  performs  all  wanted  communications.  Multistage  network  is  composed  of  several  stages 
of  fixed  long  link  patterns  combined  with  dynamic  short  link  patterns  (the  switch  element  ar¬ 
ray).  In  our  approach,  the  fixed  long  link  patterns  would  be  optical  and  the  switch  element  array 
would  be  implemented  with  electronic  technology  which  is  efficient  for  neighbouring 
connections. 

Different  attractive  patterns  of  multistage  networks  are  compared  according  to  the  following 
criteria :  _  to  minimi  ze  the  number  of  crossing  through  the  network  for  the  desired 

communications; 

_  to  achieve  the  global  reconfiguration  with  a  SIMD  process  i.e.  a  single 
instruction  configures  all  switches  of  a  same  stage. 

The  classical  multistage  networks  for  our  applications  seem  not  to  respond  to  our  need,  so  we 
have  developed  a  specific  one.  It  is  a  bidimensional  Omega  /  shuffle-exchange  network  with 
additional  neighbouring  connections  between  nods  in  a  same  stage.  For  N2  inputs,  it  consists  of 
log2(N)  stages  with  all  same  fixed  long  link  pattern  corresponding  to  a  bidimensional  perfect 
shuffle. 

Different  network  architectures  were  considered  depending  on  : 

_  the  network  configuration  process  ; 

_  the  alternative  between  a  implementation  with  log2(N)  same  physical  stages  or  a  sin¬ 
gle  physical  stage  implementation. 
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We  compare  the  different  networks  accor¬ 
ding  to  the  following  parameters  :  the 
network  configuration  time  ;  the  data  trans- 
fert  time ;  the  number  of  optical  connections. 
We  take  also  into  account  the  capability  of 
the  network  to  be  bi-directional  as  opposed 
to  a  mono-directional  ring  structure.  We 
have  finally  chosen  the  architectural  option 
including  a  monostage  N2— >N2  network  and 
a  network  configuration  process  using  a 
fixed  broadcasting  1— network  .  In  this 
option,  PEs  are  interconnected  with  an  elec¬ 
trical  mesh  network  so  that  they  simulate  the 
switch  element  array.  Optical  2D  perfect 
shuffle  network  and  the  optical  broadcasting 
1-»N*  network  are  connected  to  the  proces¬ 
sing  unit  array  (see  figure  1). 

After  this,  we  wiP  discuss  different  optical 
means  which  allow  to  implement  fixed  long  link  patterns.  Our  goal  was  to  estimate  the  deflec¬ 
tion  limitations  of  these  different  teenies  and  verify  if  they  are  able  to  produce  a  highly  parallel 
2D  perfect  shuffle  pattern. 

The  interfaces  between  the  N2  processing  units  and  the  optical  fixed  long  link  pattern  is  com¬ 
posed  of  source  and  detector  arrays.  The  emitted  beams  are  supposed  gaussian  and  they  are 
collimated  and  focused  by  microlens  arrays.  Refractive  and  Diffrative  optics  are  possible  to 
perform  the  needed  deflections  :  a  microprism  array  ;  a  micrograting  array  ;  a  CGH  producing 
both  the  deflection  and  microlens  functions  (example,  shifted  Fresnel  microlens). 

We  have  analysed  theoretically  the  deflec¬ 
tion  function  based  on  a  microprism  array 
and  implemented  an  experimental  setup  to 
validate  this  concept.  The  experiment, 
described  in  figure  2,  consists  in  : 

_  a  pigtailed  laser  diode  with  a  fiber  core 
diameter  of  5  pm  to  simulate  a  VCSEL 
(Vertical  Cavity  Surface  Emitting  Laser 
I  diode) ; 

_  two  103x103  250  pm  diameter  micro¬ 
lens  arrays ; 

_  a  glass  flat  which  can  be  rotated  to  si- 
Figure  2 :  basic  experimental  bench  to  study  the  initiate  different  angle  prisms  ; 
deflection  system  using  a  microprism  array.  _  a  CCD  camera  for  detection. 

The  experimental  results  were  in  agreement  with  the  theoretical  model.  The  theorical  results  are 
cheering  and  show  that  a  256  x  256  perfect  shuffle  network  could  be  done. 

We  have  presented  results  about  a  deflection  system  using  microprisms.  However  a  refractive 
microprism  array  having  a  particular  angle  for  each  elementary  prism  seem  not  to  be  easy  to 
produce.  Diffractive  elements  will  probably  be  more  compatible  with  planar  electronic  inte¬ 
grated  technology. 


microtans  microtons 


Figure  1 :  our  selected  architecture. 
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Abstract:  We  present  the  design  and  implementation  of  an  optical  ring  array  free  space 
interconnect  The  system  was  successfully  operated  at  a  data  rate  of  500  MHz  and  with  a  total 
system  throughput  of  8  Gbps. 

Summary:  Electronic  interconnections  have  been  recognized  as  the  bottlenecks  of  high 
performance  computing  systems  Because  of  their  3-D  nature  and  their  matched  impedance, 
optical  free  space  interconnects  are  the  best  alternative  to  electronic  counterparts.  Massively 
Parallel  Machines  (MPMs)  require  that  the  interconnect  latency  between  processors  be  kept  as 
small  as  possible.  For  this  purpose,  we  have  developed  optical  interconnects  using  ring 
topologies  [1], 

The  principle  of  a  Nearest  Neighbor  ring  topology  is  depicted  in  Fig.l.  Fig.  1(a)  shows  a  NN 
interconnect  based  on  a  rectangular  grid  topology  Hie  major  problem  is  that  the  interconnection 
length  between  processors  is  not  the  same  on  the  boundary  as  in  the  central  region.  This  unequal 
interconnection  length  will  cause  system  latency.  The  problem  becomes  severe  when  the  total 
number  of  processors  is  increased.  By  arranging  the  processors  in  a  ring  as  shown  in  Fig.  1(b), 
this  boundary  effect  can  be  eliminated.  Other  rectangular  grid  topologies  such  as  PM2i  and 
Hypercube,  can  be  implemented  in  a  similar  fashion.  In  addition,  operations  required  by  the  ring 
topologies  can  be  implemented  by  performing  simple  image  rotations. 

Our  experimental  system  is  shown  in  Fig.2.  It  consists  of  a  data  generator,  an  input  ring 
assembly,  a  four-channel  free  space  interconnect,  and  an  output  ring  assembly.  The  input  ring 
assembly  consists  of  16  laser  transmitters;  each  being  individually  modulated  by  a  500  MHz  PN 
data  sequence  generated  by  a  data  generator.  In  the  four  channel  interconnect,  each  channel  is 
designed  to  perform  one  of  the  four  operations  required  by  the  NN  interconnect  To  demonstrate 
channel  switching  operations.  Liquid  Crystal  (LQ  switches  (sl-s5)  were  employed  in  each 
channel.  A  single  GaAs  receiver  was  employed  at  the  position  of  the  output  ring  assembly.  Data 
sequences  for  the  four  interconnect  channels  were  received  sequentially  by  setting  the  proper 
state  of  each  LC  switch.  The  received  data  sequence  was  then  compared  with  that  of  die  input 
The  experimental  results  showed  that  the  data  sequences  corresponding  to  the  NN  interconnect 
woe  received  correctly  fra  all  16  transmitters.  We  also  demonstrated  that  other  topologies  such 
as  PM2i  and  Hypercube  can  be  implemented  by  performing  different  rotation  operations.  This 
novel  interconnect  using  space  invariant  optical  elements,  having  identical  interconnect  latency. 
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and  allowing  network  reconfigurability,  will  be  suitable  for  high  speed  Massively  Parallel 
Machines  such  as  SIMD  machines, 

Refteen©^ 
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distribution  topology  for  optical  interconnects,"  Appl  Opt.  31,  5548-555?  (1992) 
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(c)  (d) 


Fig.  3  Experimental  results  of  the  Nearest  Neighbor  interconnect.  The  pulse  width  is  2  ns.  (a) 
corresponds  to  i+4  interconnect;  (b)  corresponds  to  i+1  interconnect;  (c)  corresponds  to  i-4 
interconnect;  (d)  corresponds  to  i-1  interconnect.. 
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ABSTRACT 

The  advantages  of  a  spatial  light  modulator  based  system  for  reconfigurable  optical 
interconnections  in  parallel  processing  applications  are  discussed.  The  design  of  the  system  and 
experimental  results  are  described. 

L  INTRODUCTION 

Optical  interconnections  have  many  advantages  over  electronic  connections  in  computing  and 
communication.  In  addition  to  these  advantages  a  degree  of  programmability  of  die  optical 
interconnections  can  provide  further  benefits.  In  tins  paper  we  concentrate  on  interconnections 
in  optical  computing  systems  using  nematic  liquid  crystal  spatial  light  modulators  (LC-SLMs). 
These  devices  have  been  chosen  for  their  low  cost,  easy  availability,  and  straightforward 
configuration  by  interfacing  of  the  drive  electronics  to  host  computers. 

2.  ROLE  OF  RECONFIGURABLE  OPTICS 

Reconfigurable  interconnections  have  several  advantages  over  fixed  interconnection  stages.  In 
particular  they  allow  the  maximisation  of  the  perl  Trance  gains  of  non-local  optical 
interconnections  [1  ]  over  a  range  of  algorithms  inherent  in  practical  image  processing 
They  may  also  be  used  to  provide  acceleration  of  individual  algorithms  by  changing  die 
processor  interconnection  harness  during  run-time  of  a  program. 

We  have  investigated  how  reconfigurable  interconnects  may  be  employed  over  a  small  range  of 
tasks  including  image  correlation,  noise  removal  and  enhancement,  FFTs  and  sorting.  In  this 
study  we  have  used  such  interconnects  as  we  have  experimentally  configured,  including  nearest 
neighbour,  next-nearest  neighbour,  image  dilation  and  various  shuffle  connections.  In  particular 
we  have  built  a  set  of  optimised  tasks  which  may  be  expected  to  run  on  our  own  demonstrator 
hardware  containing  a  reconfigurable  plane.  This  enables  us  to  generalise  die  functionality  of 
our  optical  processing  modules  while  maintaining  the  performance  advantages  of  the  optics, 
thus  increasing  the  overall  performance  of  tbe  system.  Numerical  results  obtained  by  simulation 
of  our  reconfigurable  architecture  on  a  distributed  array  processor  (DAP),  allow  comparison 
with  non-reconfigurabie  computing  schemes. 

X  DEMONSTRATION 

Optical  interconnections  may  be  classified  into  two  categories:  space-invariant  and  space- 
variant  Although  both  of  those  have  useful  applications,  die  latter  is  highly  desirable,  because 
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of  its  ability  to  implement  arbitrary  interconnections.  We  have  already  demonstrated  fan-out 
beam  array  generation  with  a  multi-level  phase  LCSLM  [2  ]  as  well  as  a  dynamic  perfect 
shuffle  interconnection  [3  ].  The  present  system,  although  limited  to  binary  phase  modulation, 
uses  a  LCSLM  with  higher  resolution  and  smaller  pixel  size  than  die  previous  device.  The  pixel 
sire  of  31pm  is  currently  die  smallest  available  for  this  type  of  device.  In  comparison  to  die 
iwintnmmi  feature  size  of  fixed  holographic  interconnections  manufactured  by  lithographic 
nwrfmiqiMg,  this  is  relatively  large.  The  large  pixel  size  limits  the  design  freedom  of  the  phase 
pattern  as  well  as  producing  a  small  diffraction  angle.  The  consequences  for  the  physical  design 
of  a  system  have  previously  beat  discussed  [3,4  ]. 

We  shall  present  results  of  various  interconnection  topologies  which  have  been  implemented  in 
the  reconfigurable  optical  system  [3 ,6  ].  These  topologies  include  simple  fen-out  in  1 
dimension,  nearest  neighbour  interconnections,  nonlocal  interconnections  and  array  generation. 
Interconnection  topologies  such  as  these  are  required  for  various  parallel  and  image  processing 
algorithms.  We  shall  discuss  these  algorithms  and  describe  the  advantages  obtained  by 
providing  a  degree  of  programmability  of  the  interconnects. 
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ABSTRACT 

A  optical  package  of  reconf igurable  free- space  optical  interconnection 
switching  network  for  Multiprocessor  systea  has  been  presented.  A  separated 
(2,2,2)  node  switch  type  consisted  of  two  space  light  aodulators  SEEDs  is 
presented  as  nodes  of  network  systea,.  Optical  aodule  of  one  stage  in  optical 
switching  network  of  8  x  8  channels  is  denonstrated  and  tested. 

Key  Words:  Optical  coaputing,  optical  interconnection 

SUMMARY 

The  interest  has  recently  grown  in  reconf igurable  optical  interconnects 
and  optical  switching  networks  for  aultiprocessor  and  telecoaaunication 
systea. [l]-[3]  There  are  three  free -space  interconnection  Multistage 

networks  for  reconf igurable  optical  interconnection  network,  such  as 
Crossover,  Onega,  Banyan  network.  We  adopt  crossover  network  as 
reconf igurable  optical  interconnection  network  for  aultiprocessor.  The 
optical  systea  of  switching  network  presented  in  this  paper  coaprises  of 
optical  package  of  Log^N  , which  optical  systea  have  saae  hardware  coaponents 

except  different  period  of  priaatic  airror  arrays.  The  optical  package  of 
each  stage  consists  of  4-levels  phase  Fresnel  lenslet  arrays  (F)  ,  prisaatic 
grating  airror  arrays  (PG),  the  bean  splitters  (BS),  the  quarter  waveplates, 
self-electro-optic-effect  devices  of  electronic  addressing (SEED)  devices, 
and  testing  arrays  ,  as  show  in  Fig.l.  The  optical  systea  based  on  the 
principle  of  focal  plane  iaaging.  The  phase  Fresnel  lenslet  array  is  used  as 
the  generation  of  light  spots  array  which  split  one  colliaated  laser  bean 
with  hoaogeneous  distribution  in  soae  region  into  light  spot  array  on  the 
focal  plane.  It  is  also  used  as  iaaging  lenslet  array  to  iaaging  the  light 
spot  of  each  channel  on  each  aesa  of  SEED  and  testing  array  substrates. 

A  switching  network  coaprise  of  both  link  and  node.  A  optical  crossover 
interconnection  network  have  Log2(N)+l  node  stages  with  N/2  node  and  Log2N 

optical  link  stages  with  N  links.  The  optical  hardware  of  node  stages  is 
the  first  aodule  in  Fig.l,  which  includes  FjjBS^.SEED^and  SEED, .  A 

separated  (2,2,2)  node  type  of  electronic  address  is  presented  for  node 
switching  of  the  systea,  as  shown  in  Fig. 2.  In  optical  systea,  the  input 
iaage  is  separated  two  iaage  copes  by  BSj.  Each  channel  of  one  iaage  copy  is 

controlled  by  SEED^  of  electronic  address  to  yield  the  straight  connection. 

The  other  copy  is  controlled  by  SEED2  to  yield  crossed  connection.  The  SEED^ 

and  SEEDg  are  operated  at  the  noraal-off  state,  or  the  noraal-on  state  .  The 

connection  type  of  a  pair  of  input  and  output  ports  are  decided  by  electronic 
signals  applied  on  pixels  of  SEED^  and  SEED2.  The  optical  hardware  of  link 
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stage  is  iapleaented  by  the  second  nodule  in  Fig.l,  which  consists  of  F^i 
BS^)  N  and  PG.  The  straight  connection  is  iapleaented  by  the  route  of 
Fj-BSj-SESDj-BSj-Fg-BSg-M  -BSg-D.  The  crossed  connection  is  iapleaented  by 
the  route  of  F^-BS^-SElDg-BS^-Fg-BSg-PG-  BS^-D.  The  testing  signals  are  used 
as  0/E  converter  for  cascading  electronic  addressing  signals  of  SEED^  and 
SEEDj  in  the  next  stage. 

Based  on  the  architecture  detailed  above,  we  fabricated  an  optical 
package  of  one  stage  in  reconfigurable  free- space  optical  switching  network 
of  8x8.  All  channels  of  8x8  are  arranged  in  the  size  of  3.2  x  3.2  an  when 
the  space  between  adjacent  channels  is  0.4  ss.  But  the  size  of  optical 
package  body  is  35  x  30  x  15  ns0.  It  can  contain  32  x  32  data  channels  to 
travel  through.  Due  to  focal  length  of  phase  Fresnel  lens  can  be  aade  very 
snail,  the  design  of  optical  systen  is  very  coapact,  in  which  the  optical 
coaponents  in  the  first  and  the  second  nodules  is  fixed  by  optical  glue. The 
efficiency  of  4-levels  phase  Fresnel  lens  array  is  about  63  X.  The  insertion 
losses  of  optical  switch  devices  SEED  estinated  froa  the  aeasured,  the 
reflectivity  of  high  state  are  about  1.5  dB  .  The  switch  tine  of  SEEDs  are 
about  10  ns,  when  they  operate  in  noraal-on  state,  the  response  tine  of 
testing  array  is  about  2  ns.  Soae  basic  systea  operation  were  carried  out  on 
the  optical  package.  A  data  packet  which  includes  8  bit  of  data  is 
transaitted  through  one  optical  package  of  interconnection  network  froa  the 
input  port  to  the  output  ports.  The  four-functions  of  the  straight,  the 
exchange,  the  upper  broadcast  and  the  lower  broadcast  of  (2,2,2)  node 
switching  in  optical  interconnection  network  are  iapleaented  by  SEED 
devices.  The  output  signal  froa  testing  array  are  used  as  the  input  signal  of 
the  next  state  for  cascading  when  input  laser  incident  on  the  aesas  of  SEEDs. 
Since  SEED  device  is  the  space  light  aodulator  of  electronic  address,  so  the 
interconnection  state  is  controlled  by  electrical  Bignals  during  the  systea 
operation,  the  processors  can  use  aore  efficient.  Optical  package  is  very 
stable  and  reliab  and  is  very  easy  to  adjust.  The  optical  package  of 
reconfigurable  free-space  interconnection  switching  network  suited  for  very 
high  speed  operation  systea. 
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Abstract 

We  present  e  64-port,  fret-space  network  proto¬ 
type,  with  communication  to tea  of  greeter  then  1 
Gbit/ Sec  per  port.  Using  today’s  technology,  our 
network  scales  to  thousands  of  ports  and  is  useful 
for  massively  parallel  processing  architectures. 

1  Introduction 

The  interconnection  network  plays  an  impor¬ 
tant  role  in  the  overall  performance  of  parallel  pro¬ 
cessing  systems.  Optics  has  been  suggested  as  a 
potentially  superior  technology  over  electronic,  for 
interconnecting  thousands  of  processing  dements 
(PEs)  in  a  massively  parallel  processing  system 
(MPP).  In  an  MPP  system,  a  PE  may  be  built 
around  a  state-of-the-art,  off-the-shelf  processor 
(Digital  ‘Alpha”,  MIPS  R4400,  etc.).  To  serve 
the  interconnection  needs  of  today’s  and  near  fu¬ 
ture  processors  (“Intel  2000  CPU”),  that  are  being 
and  will  be  used  for  building  MPP  systems,  a  high 
performance  network  is  needed.  Such  processors, 
having  a  performance  rate  of  one  to  two  thousands 
MIPS  (million  instructions  per  second)  on  words  of 
information  that  are  04  to  128  bits  long,  generate 
a  total  throughput  of  128  to  258  Gbit/Sec.  Not  all 
of  this  traffic  needs  to  be  handled  by  the  network. 
A  fraction  of  this  (e.g.,  10  GBit/Sec  per  PE)  may 
cause  difficulties  in  large  systems  with  thousands 
of  PEs. 

2  The  Quest  for  the  Ideal  Network 

A  network  can  be  characterised  by  several  pa¬ 
rameters:  Number  of  ports;  bandwidth  capacity 
per  port;  the  network’s  structure;  latency  delays: 
from  one  port  to  another,  without  arbitration  (i.e., 
circuit  switching  or  reconfigurable  mode  of  opera¬ 
tion);  and  routing  and  switching  delays  (caused  by 
arbitration  and  other  traffic  specific  issues,  such  as 
blocking,  and  also  depends  on  the  network  struc¬ 
ture). 

It  is  generally  accepted  that  for  large  networks 
(thousands  of  ports),  optics  is  better  than  elec¬ 
tronics  in  providing  high  bandwidth  connections. 
However,  it  is  also  commonly  accepted  that  today, 
optics  cannot  compete  with  electronics  for  general 
purpose  processing  because  of  higher  cost  and  the 
immaturity  of  the  optics  technology. 


3  A  Reconfigurable  Free-Space  Network 
Our  proposed  optical  network  is  based  on  sev¬ 
eral  basic  principles  and  observations  on  the  needs 
and  possible  tradeoffs  of  an  optical  MPP  network: 

1)  “Logic-Less”  optical  operation:  Optical  tech¬ 
nology  is  used  for  point-to-point  high  bandwidth 
connectivity  and  not  to  do  dynamic  routing  or  ar¬ 
bitration. 

2)  Distributed  Layout  and  Control:  No  centralisa¬ 
tion  in  terms  of  one  common  element  (e.g.  a  lens) 
or  active  device  (e.g.,  one  array  of  VCSELs).  No 
central  arbitration  and  control  is  needed.  The  op¬ 
tical  network  scales  in  proportion  to  the  number 
of  PEs  (although  for  practical  reasons,  one  would 
choose  to  use  fewer,  but  larger  active  devices  such 
as  VCSELs  arrays). 

3)  Reconfiguration  is  not  needed  very  often  as 
many  parallel  application  exhibit  switching  local¬ 
ity:  i.e.,  a  PE  will  only  need  to  communicate  with 
a  small  number  of  other  PEs  for  a  period  of  time. 
Choosing  between  these  PEs  can  be  done  by  the 
electronic  switches  that  acts  as  “interconnection 
caches”  to  the  initial  reconfiguration  of  the  optical 
network. 

3.1  Folded  Clos  Network 


Figure  1:  A  folded  three  stage  Clos  network. 

Figure  1  presents  the  structure  of  our  network. 
The  middle  stage  is  the  optical  network,  made 
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at  logic- less*  twitches  (reconfigurable  or  circuit- 
switching  operation)  using  VCSELt  array a.  The 
fust  and  third  stages  (folded  into  one)  are  made  of 
electronic,  small  crossbar  switches. 

Such  a  network  can  be  used  to  map  parallel  ap¬ 
plications  that  have  switching  locality.  We  have  in¬ 
vestigated  the  mapping  issue  previously  and  came 
to  a  conclusion  that  many  classical  communication 
patterns  (such  as  trees,  2-D  and  3-D  mashes,  hy¬ 
percubes,  pyramids,  mesh  of  trees  etc.)  as  well  as 
Don-symmetrical  communication  patterns  can  be 
well  embedded  in  our  structure. 

3.2  Mapping  rad  Embedding 

Figure  2  is  an  example  of  the  embedding  of 
a  tree  into  our  network.  Note  that  the  cluster¬ 
ing  means  that  nodes  in  the  same  cluster  will  be 
mapped  to  boards  that  share  a  common  small  elec¬ 
tronic  crossbar  switch.  The  connections  between 
clusters  are  made  by  reconfiguration  of  the  optical 
middle  layer  of  the  Clos  network. 


Figure  2:  Embedding  a  binary  tree  into  folded 
Clos. 

4  System  Overview 

Figure  3  represents  the  overall  system  view. 
PEs  are  groups  into  boards,  each  having  a  fast 
electronic  crossbar  switch.  Boards  are  arranged 
into  columns  that  are  connected  using  optical  free- 
space  multiple  buses.  Each  board  can  be  connected 
to  few  other  baords  (8  to  16).  The  selection  of 
these  boards  is  done  by  reconfiguration  of  the  op¬ 
tical  interconnections .  This  mode  of  operation  is 
useful  in  many  parallel  applications  that  exhibit 
switching  locality  or/and  have  “phases”  of  compu¬ 
tation  with  specific  interconnection  topologies. 

We  used  VCSELs  arrays  that  are  arranged  as 
8x8  devices  with  access  to  each  one  of  the  VC- 
SELs  individually.  Such  an  access  is  not  needed 
for  our  architecture,  but  these  devices  where  avail¬ 
able  off-the-shelf.  By  having  access  to  individual 
VCSELs,  we  could  used  only  four  8x8  devices 
that  are  subdivided  into  16  separate  tones  each 
having  2x2  devices.  Tinas  each  PE  can  reconfig¬ 
ure  its  subdivision  to  connect  to  one  of  the  other 
3  boards. 

5  Scalability  rad  Future  Issues 

Future  systems  may  have  larger  numbers  of 
PEs.  One  critical  element  is  the  site  and  the  elec- 


Figure  3:  System  View  of  the  Optical  Network. 


trical  connections  for  the  VCSELs  arrays  to  be 
used.  Our  prototype  is  an  example  of  the  foreseen 
maximum  needed  (i.e.,  16  connections  per  VCSELs 
array).  We  anticipate  fewer  electrical  connections 
in  future  VCSELS  arrays.  Such  devices  can  be 
made  as  an  X-Y  matrix  or  by  embedding  an  “11- 
Tree”  structure  to  control  each  sub-division  on  the 
array  for  each  PE.  The  “leaves”  of  this  H-Tree 
structure  end  with  individual  VCSELs  that  are  se¬ 
lected  to  illuminate.  By  selecting  a  certain  VC- 
SEL,  the  connection  to  that  corresponding  Board, 
can  be  made  by  the  optical  layer  of  the  Clos  net¬ 
work.  The  number  of  high  bandwidth  connections 
that  need  to  be  made  to  the  package  of  such  an 
array,  will  be  limited  to  at  most  16.  In  addition, 
low  speed  connections  used  for  choosing  the  ap¬ 
propriate  VCSEL  in  each  sub-division  are  needed. 
These  are  low  speed  connections  since  the  reconfig¬ 
uration  is  expected  not  to  be  done  very  often.  One 
important  issue  to  address  is  that  of  the  power 
budget.  Currently,  we  have  employed  “variable 
efficiency’1  holographic  elements  to  maximise  the 
power  available.  In  addition  to  this,  it  may  be  nec¬ 
essary  to  add  amplification  or  regeneration  of  the 
optical  data,  in  certain  location,  for  a  high  band¬ 
width  and  low  noise  communication .  We  plan  to 
look  into  such  problems  in  the  future.  We  will 
attempt  to  build  part  of  a  larger  network  for  con¬ 
necting  thousands  of  PEs  and  with  bandwidth  of 
10  GBit/Sec  per  channel. 

6  Summary 

We  have  presented  the  system  structure  of  a 
prototype  for  an  optical  interconnection  network  to 
be  used  for  MPP  architectures.  Our  system  bene¬ 
fits  from  the  high  connectivity  and  throughput  op¬ 
tics  can  offer  while  limiting  the  processing  needed 
for  routing  to  electronic  technology.  Many  parallel 
applications  can  use  such  a  reconfigurable  network 
as  they  naturally  exhibit  switching  locality  in  their 
communication  needs.  We  hope  to  extend  our  sys¬ 
tem  in  the  future  to  accommodate  thousands  of 
PEs  in  a  multi-Gigabit/Sec  channels. 
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Abstract 

Reconfigurable  optical  interconnections  by  using  double  phase  conjugate  mirror  in 
photorcfractive  Bi^HO*,  crystal  arc  demonstrated.  Phase  conjugate  wave  reflectivity  up  to 
7%  and  with  a  response  time  of  5  seconds  upon  pump  intensity  1  mW/mm2  was  obtained  at 
X  =  632.8  nm. 

Summary 

Photorcfractive  ciystals  (PRC)  have  beat  proved  to  be  significant  in  parallel  optical 
information  processing  owing  to  their  real-time,  high  density,  recyclable  volume  holographic 
recording  and  phase  conjugation.  One  of  the  most  promising  and  interesting  effects  recently 
discovered  in  PRC  is  double  phase  conjugation  of  mutually  incoherent  pump  beams.  For  the 
PRC  illuminated  on  opposite  faces  by  two  pumps,  two  independent  counterpropagating  beams 
build  up,  originating  from  the  beam  fanning.  They  give  rise  to  a  common  grating,  which  is 
reinforced  by  positive  feedback  and  finally  results  in  die  presence  of  a  pair  of  phase  conjugate 
beams  through  a  cross-readout  process1.  This  double  phase  conjugate  mirror  (DPCM)  can  be 
used  as  reconfigurable  interconnections  between  two  sets  of  fibers2.  As  an  example, 
interconnection  between  one  fiber  and  a  set  of  three  fibers  is  shown  on  the  Fig.l.  Light 
emitted  from  fire  fiber  (Input  1)  is  focusing  by  the  lens  1  on  the  PRC’s  face.  At  the  same 
time,  the  light  emitted  from  the  set  of  fibers  (Inputs  2-4)  is  focusing  by  the  lens  2  on  the 
opposite  side.  After  a  response  time  x,  common  holographic  grating  is  recorded  in  the  PRC, 
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producing  a  pair  of  phase  conjugate  beams,  which  are  automatically  introduced  into  the  fiber 
from  one  side  and  into  the  set  of  fibers  from  the  other  side.  Phase  conjugate  beams  can  be 
extracted  by  using  beamsplitters  BS.  Therefore,  die  signal  from  the  Input  1  achieves  all  the 
Receivers  R2-R4  and  the  sum  of  die  signals  from  the  Inputs  2-4  achieves  Receiver  Rl.  In  this 
example  die  Input  1  is  interconnected  with  all  three  Inputs  2-4.  Interconnections  can  be 
simply  changed  by  introduction  of  light  only  into  two  fibers  3  and  4,  for  example.  In  that  case 
after  the  response  time  t,  a  new  common  holographic  grating  will  be  recorded  in  the  PRC 
and  new  interconnections  between  Input  1  and  Inputs  3,4  will  be  established. 

Experiments  were  carried  out  on  a  fiber-like  Bi^TiO^  (BTO)  samples.  This  crystal 
belongs  to  the  same  structural  class  (sillenite)  as  Bi^SiO^.  However,  BTO  has  larger  electro¬ 
optic  coefficient  and  lower  optical  activity  compared  with  other  sillenites,  and  is  more 
sensitive  compared  with  BaTiOj  and  SBN  photorefractive  crystals.  The  fibers  were  cut  out 
from  the  bulk  crystal  along  the  [110]  crystallographic  axis  and  were  glued  between  two 
electrodes3.  Their  length  is  8-18  mm,  and  they  have  rectangular  cross-section  with  a  size  of 
approximately  1  mm.  A  bipolar  alternating  electric  field  of  square-wave  form  was  applied  to 
the  fiber  to  improve  its  enhancement  coefficient.  Four  longer  faces  of  the  fiber  were  optically 
polished,  but  end-faces  were  slightly  grounded  to  produce  a  scattering  of  transmitted  beam 
into  the  angle  of  approximately  4  degrees.  It  was  especially  made  to  introduce  pump  beams 
of  complex  (speckle-like)  wavefront  into  the  fiber,  that  allowed  us  to  avoid  conical 
degeneracies  in  the  Bragg  matching  angle  on  a  given  volume  grating4.  Two  independent 
linearly  polarized  helium-neon  lasers  (X  =  632.8  nm)  were  used  to  pump  the  BTO-fiber.  The 
input  polarization  angles  were  chosen  experimentally  to  produce  as  high  phase  conjugate 
reflectivity  as  possible.  The  fiber  axis  was  tilted  in  respect  to  the  pump  beams  (as  shown  on 
the  Fig.l)  providing  at  least  one  internal  reflection  for  any  pump  from  fiber  faces. 

Upon  these  experimental  circumstances,  the  efficiency  of  transferring  the  pump  beam 
into  the  phase  conjugate  beam  up  to  7%  has  been  measured  in  our  fiber.  Note  that  the 
refractive  index  of  BTO  crystal  is  rather  high  (Oq  =  2.58  at  X  =  632.8  nm)  introducing  big 
losses  of  light  because  of  Fresnel  reflections.  Hence,  the  end-faces  coating  of  the  fiber  by 
antireflection  layers  could  increase  the  PC  reflectivity  at  least  twice.  The  response  time  t  is 
inversely  proportional  to  the  intensity  of  pump  beams,  and  was  measured  to  be  5  seconds  for 
pump  beam  intensity  of  1  mW/mm2.  We  succeeded  to  obtain  phase  conjugate  beams  of  com¬ 
plex  images  with  spatial  bandwidth  up  to  50  lp/mm. 

Single  crystal  photorefractive  fibers  have  several  advantages  compared  to  the  bulk 
crystals.  They  can  be  assembled  in  a  variety  of  configurations,  isolating  each  stack  of 
interconnections  from  other  stacks  in  adjacent  fibers.  This  can  provide  realization  of  great 
amount  of  interconnected  fibers  with  a  possibility  of  independent  reconfiguration  of  different 
groups  of  them.  Moreover,  photorefractive  fibers  are  easier  to  grow  by  using  the  Laser  Heated 
Pedestal  Growth  Technique,  that  possibly  diminishes  the  price  of  DPCM  elements. 

References: 

1.  P.  Yeh,  T.  Y.  Chang,  and  M.  D.  Ewbank,  J.  Opt  Soc.  Am.  B,  5,  1743  (1988). 

2.  S.  Weiss,  M.  Segev,  S.  Stemklar,  and  B.  Fischer,  Appl.  Opt,  27,  3422  (1988). 

3.  A.A.  Kamshilin,  R.  Ravattinen,  H.Tuovinen,  T.  Jaaskelainen,  and  V.V.  Prokofiev, 
Optics  Comm.  103,  221  (1993). 

M.  P.  Petrov,  S.  L.  Sochava,  and  S.  L  Stepanov,  Opt  Lett  14,  284  (1989). 


4. 


WP8/245 


A  compact  holographically  routed  optical  crossbar  using  a  ferroelectric  liquid-crystal 

over  silicon  spatial  light  modulator. 

D.C. O’Brien,  Douglas.  J.  Mcknight,  Optoelectronic  Computing  and  Systems  center,  Campus  Bax  525, 
University  of  Colorado,  Boulder,  CO  80309.,  USA.  (tel  303  492  3330) 

Abstract 

A  holographically  routed  crossbar  interconnect  is  currently  under  development,  using  a  ferroelectric 
liquid  crystal  over  silicon  spatial  light  modulator  as  a  programmable  hologram.  Results  of  simulation 
and  experiment  are  presented  and  the  potential  of  the  technique  discussed. 

INTRODUCTION 

Reconfigurable  free  space  interconnects  have  wide  application  in  optical  computing  and  information 
processing,  and  crossbars  are  a  well  known  means  to  implement  them:  All  inputs  are  connected  to 
all  outputs  and  unwanted  routes  are  blocked.  For  an  N  channel  switch  this  approach  has  an  intrinsic 
fan-out  loss.  A  programmable  hologram  can  be  used  to  replace  the  fan-out  optics  and  blocking 
component,  creating  a  dynamic  fan-out  element  to  steer  the  light  to  the  desired  output  port  [1].  This  has 
the  potential  to  remove  the  fan-out  loss  in  shuttered  geometries,  and  create  interconnects  with  analogue 
weights  using  a  binary  device.  Ferroelectric  Liquid  Crystal  spatial  light  modulators  (FELC  SLMs)  have 
a  robust  binary  phase  modulation  scheme  [2],  so  that  a  binary  phase  grating  can  be  written  to  the  SLM 
to  create  the  dynamic  fan-out  element.  In  this  paper  we  present  design  calculations  for  a  16  input  16 
output  holographically  routed  crossbar  interconnect,  and  preliminary  results  for  individual  component 
performance.  Full  results  for  the  system  will  be  presented  at  the  conference. 

OVERVIEW 

Figure  1  shows  a  schematic  of  the  crossbar  geometry.  The  16  channels  are  arranged  as  a  4x4  matrix, as 
this  makes  lull  use  of  the  aperture  of  the  optical  system.  The  SLM  is  divided  into  16  regions,  each  of 
which  is  illuminated  by  one  of  the  16  sources.  Each  region  displays  a  particular  base  hologram,  designed 
to  steer  the  illuminating  beam  to  the  correct  output  port.  There  are  16  base  holograms,  each  routing  to 
a  particular  output  port:  to  route  from  port  1  to  port  16  the  correct  base  hologram  is  placed  in  front  of 
source  1. 

We  are  considering  several  different  sources  for  the  crossbar:  The  preferred  option  is  a  vertical  cavity 


Figure  1:  Schematic  of  holographic  crossbar 


surface  emitting  laser  array  emitting  at  840nm  ,  although  control  of  polarisation  across  the  array  appears 
difficult. 

The  holograms  are  replayed  on  a  256x256  liquid-crystal  over  silicon  spatial  light  modulator  (SLM) 
[3]  with  a  21.6pm  pixel  pitch.  Each  hologram  is  64x64  pixels  and  is  created  using  a  simulated  annealing 
technique  [1].  They  are  designed  to  route  to  a  2x8  Silicon  photodiode  array  which  forms  the  output  of 
the  crossbar. 

The  output  plane  is  a  2x8  array  of  Silicon  photodiodes  fabricated  in  a  2pm  n  well  CMOS  process. 
These  have  a  200pm  square  active  area  on  a  250pm  pitch.  The  circuits  have  on  board  thresholding  and 
CMOS  output  levels. 

SIMULATION 

The  crossbar  fails  when  the  power  received  at  a  detector  which  is  supposed  to  be  off  is  greater  than  the 
lowest  power  level  which  should  be  detected  as  a  one  level  (the  detector  threshold).  In  a  holographic 
crossbar  this  is  mainly  caused  by  the  holograms  ‘scatttering’  light,  which  creates  a  uniform  noise  level 


WP8/246 


Ptot  of  simulated  «ad  rocanscd  spot  intensities  Contour  plot  of  replay  of  all  routing  holograms 
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Figure  2:  Performance  of  beamsteering  holograms 


over  the  detector  plane.  As  more  holograms  are  illuminated  this  level  can  build  up  and  cause  the  switch 
to  fail.  In  operation  the  outputs  can  either  be  illuminated  or  dark,  creating  a  finite  number  of  switch 
states,  which  can  be  simulated.  In  this  case  the  worst  case  simulated  crosstalk  (highest  off/ lowest  one) 
is  0.16,  showing  the  crossbar  should  function  with  good  noise  margin. 

MEASUREMENT 

Holograms  were  calculated  and  replayed  on  a  64x64  pixel  region  (corresponding  to  a  subhologram)  of  the 
SLM  to  test  the  performance  of  the  device.  A  collimated  beam  (A  =  633nm  passes  through  a  polariser  and 
illuminates  the  device. The  reflected  wave  is  Fourier  transformed  with  an  achromat  and  passes  through 
an  an alsyer.  Polariser  and  analyser  are  rotated  to  create  binary  (0  and  v)  phase  modulation.  A  grating 
is  created  by  setting  alternate  rows  of  the  SLM  to  opposite  states.  This  should  have  no  central  DC  spot 
in  replay  if  the  device  is  binary  phase,  with  two  adjacent  bright  spots.  The  measured  ratio  of  the  bright 
spots  to  the  DC  spot  is  33.0  ±  1.65.  This  compares  favourably  with  other  ferroelectric  SLMs  [1-2]  and 
indicates  the  SLM  is  operating  dose  to  binary  phase.  In  this  configuration  approximately  2%  of  the  power 
is  diffracted  into  each  peak  (excluding  external  losses),  compared  with  a  theoretical  value  of  40%.  It  is 
interesting  to  note  that  a  similar  shuttered  crossbar  would  have  an  intrinsic  transmission  of  6%  before 
real  losses  are  taken  into  account-  only  a  modest  improvement  in  device  performance  is  required  to  reach 
this  figure,  and  large  improvements  are  expected  for  a  device  with  a  liquid  crystal  layer  optimised  for 
t’  3,  wavelength. 

Each  of  the  beamsteering  holograms  was  displayed  chi  the  SLM  and  the  ratio  of  peak  intensity  to 
uniform  noise  intensity  measured.  Figure  2  shows  a  comparison  of  actual  and  simulated  performance, 
and  a  contour  plot  of  the  replay  of  all  the  base  holograms.  The  variation  in  simulated  peak  heights  is  due 
to  the  envelope  caused  by  the  pixel  transmission  function,  which  should  be  sine4  if  each  pixel  transmits 
uniformly  across  its  area.  In  reality  the  function  is  more  complex,  which  creates  measured  peaks  which 
are  actually  more  uniform  than  predicted.  The  ratio  of  the  spot  intensity  to  the  noise  level  in  the  holo¬ 
gram  was  measured  few  a  representative  hologram  and  was  37±10  dB.  Together  these  results  mean  that 
the  interconnect  should  function  with  a  high  noise  margin. 

CONCLUSIONS 

Preliminary  results  indicate  high  quality  holograms  can  be  generated  with  the  SLM  device  and  that  the 
crossbar  should  function  with  good  noise  margin.  Overall  transmission  through  the  interconnect  can 
be  increased  beyond  an  equivalent  shadow  routed  crossbar  with  only  modest  improvements  over  these 
preliminary  results,  showing  the  potential  of  the  interconnect  and  SLM  device. 
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Abstract 

Two  approaches  to  optical  interconnect  using  pixellated  spatial  light  modulators  as  reconfigurable  diffrac¬ 
tive  optical  ahmssts  an  prsaantsd  use  analytical  using  a  low  resolution  SLM,  the  other  using 
annealing  and  a  state-of-the-art  silicon  backplane  device. 

Summary 

An  analytical  approach  shows  how  simple  switching  may  be  achieved  using  a  low  resolution  SLM. 
Previous  work 111  has  shown  that  the  delta-function-like  spikes  in  the  power  spectrum  of  a  pixellated  filter 
which  give  rise  to  the  replication  may  be  attenuated  or  removed  by  selection  of  a  specific  pixel  pitchanse 
ratio,  and  introduction  of  a  specific  pixel  position  randomisation  scheme  which  applies  a  displacement 
o f  the  pixel  centre  from  its  regular  position.  This  generalises  the  expression  for  the  power  spectrum  of 
the  Fourier  plane  filter  ((<(*,  y)!1)  with  all  pixels  'on'  to 

<|t(*.y)la>  =  l-P(*.v)la  x  Q>  [l  -  |p(*)|s|p(y)|a  +  « 

where  P(»,  y)  is  the  Fourier  transform  of  the  single  pixel  transmission  function,  p(x)  and  p(y)  are 
the  Fourier  transforms  of  the  pixel  position  probability  distribution  functions,  a  is  the  pitch  of  the 
underlying  regular  array,  and  the  array  consists  of  Q  X  Q  pixels.  The  spectral  orders  ate  placed  at 
(n/a,m/a),  n,m  =  0,±1,±2, ...  With  square  pixels  of  side  a/2,  P(«,  y)  =  ^-smcJ($*,  $ y)  which  has 
serosa  at  the  positions  of  all  the  even  numbered  spectral  orders  other  then  the  sero  order.  Further, 
choosing  p(s)  =  cos(^c)  and  p(y)  =  cosf^y),  we  find  that  the  allowed  displacements  of  the  pixel 
from  the  regular  position  are  such  that  each  pixel  can  take  one  of  four  allowed  positions  in  an  a  x  a 
square,  (figure  1)  and  moot  importantly  the  odd  numbered  spectral  orders  in  both  *  and  y  directions 
are  eliminated  by  seroes  of  p(c)  and  p(y).  If  however  we  retain  periodicity  in  either  the  *— direction 
(p(c)  =  1)  and/or  the  y- direction  (p(y)  =  1),  the  first  orders  on  the  respective  axes  will  remain.  The 
power  in  the  attenuated  orders  is  redistributed  into  a  diffuse  background  ~  1  /Q1  times  the  sero-order 
intensity  (~  QA/Q1  times  the  first  order  intensity). 

Thus  a  means  exists  whereby  information  can  be  routed  via  certain  of  the  first  order  replicas,  depen¬ 
dent  on  the  distribution  of  the  transmitting  subpixel  in  each  cell  of  the  array.  Figure  2  helps  illustrate 
this. 
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Figure  1:  a)  Possible  pixel  positions  within  ax  a  cell,  b)  A  resultant  16x16  array  of  pixels 

Using  a  16x16  SLM  a  2x2  block  of  pixels  could  be  used  to  code  each  of  the  a  x  a  cells,  giving  an  8x8 
array  of  cells  with  "n*  pixel  'on*  in  cell.  The  SLM  would  be  capable  of  routing  8x8  pixel  arrays  when 

used  in  the  Fourier  plans  of  a  4-/  qptical  processor,  offering  the  benefits  of  parallelism  and  a  modest 
degree  of  image  fanout.  The  ‘on’  first  order  replicas  have  the  same  intensity  regardless  of  whether  there 
are  sero,  two  or  four  of  them  (to  a  good  approximation) :  this  would  not  be  the  case  if  the  routing  were 
achieved  simply  by  writing  stripe  patterns  on  the  SLM  to  approximate  diffraction  gratings. 
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Figure  2:  Selective  removal  of  sharing  to  provide  selective  fanout/routing 

Here  useful  adaptive  optical  interconnection  ma y  be  achieved  using  binary  phase  Computer  Gener¬ 
ated  (CGHs)  displayed  on  SLMa.  A  very  attractive  generic  type  of  SLM  is  the  Ferrodectric 

Liquid  Crystal  over  Very  Latte  Seals  Integration  (FLC/VLSI)  SLM*.  This  device  has  a  FLC  cell  fab¬ 
ricated  on  top  of  a  custom  designed  VLSI  ribcon  backplane.  The  backplane  contains  an  array  of  pixd 
memory  dements,  pixd  mirrors  and  addressing  circuitry.  The  controllable  pixels  in  the  SLM  modulate 
the  relative  phase  by  exactly  0  or  v  due  to  the  switch  able  nature  of  the  FLC  structure. 


Figure  3:  An  8  x  8  holographic  interconnect  from  the  SLM 

Figure  3  shows  an  8x8  array  of  spots  generated  using  a  CGH  displayed  on  a  176 x  170  FLC/VLSI 
SLM.  The  d.c.  spot  is  very  bright  das  to  the  low  pixd  fill  factor,  inherent  in  backplanes  fabricated 
using  commercial  silicon  processing  with  the  microcircuitry  located  around  the  minors.  The  minors 
are  fabricated,  at  present,  using  a  standard  VLSI  aluminum  metal  layer.  This  layer  is  deposited  with 
electrical  conductivity,  rather  than  optical  flatness,  as  the  main  criterion.  The  minors  are  not  optically 
flat  and  hence  scatter  light  to  give  noiee  in  the  reconstruction.  The  rough  surface  profiles  of  the  minors 
also  prevent  uniform  high  quality  alignment  of  the  FLC  over  the  array. 

Planarisatioo  techniques  may  be  used  to  overcome  the  problems  associated  with  the  VLSI  fabricated 
circuitry  and  minors  by  burying  them  beneath  a  polished  dielectric  film,  depositing  high  quality  metd 
minora  on  the  surface  and  providing  smell  interconnection  vies  through  the  dielectric  layer  between  the 
original  and  new  metal  layers4.  This  processing  is  expected  to  increase  the  reflection  efficiency  of  the 
devices  by  about  a  factor  of  2  and  increase  the  fill  factor  to  nearly  100%. 

The  improvements  to  the  SLM  performance  will  be  presented,  and  comparisons  of  experimental  and 
theoretical  results  for  the  high  fill  factor  SLM,  performing  holographic  interconnection  will  be  discussed. 
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At  presort,  reconfigurable  interconnection  networks  realizing  data  exchange  between  input 
and  output  ports  are  a  bottle-neck  in  high-performance  computers  and  telecommunication 
systems.  The  throughput  of  such  switching  systems  (SSs)  depends  primarily  on  data  capacity 
W  of  connected  pairs  of  input-output  channels  and  the  reconfiguration  time  t,.  An  advantage  in 
principle  of  optical  techniques  over  electrical  ones  offers  strong  possibilities  to  increase  W  by 
data  transmitting  through  connected  pair  of  channels  in  form  of  2-D  parallel  binary  digital  code 
patterns  with  no  changing  of  t,..  However,  the  specific  peculiarities  of  optics  demand 
architectural  principle  for  photonic  SS  different  from  electrical  one. 

The  architecture  of  a  large  size  photonic  SS,  first,  must  make  sure  for  2-D  flat  image 
transmission  through  each  pair  input-output  ports  for  any  interconnect  pattern,  second,  have 
great  enough  numerical  aperture  of  connected  optical  channels,  third,  provide  a  very  low 
crosstalk.  In  this  report  architectural  decisions  that  satisfy  these  conditions  are  proposed  for 
bulk  multistage  photonic  SS  of  size  2N2x2N2,  where  N=2r,  r  =  1,2,3,  ....  ,  consisted  of 
cascaded  along  K  switching  stages  (polarization  switching  spatial  light  modulators)  and  K-l 
routing  stages,  which  are  combined  by  straight  optical  connections. 

The  three  types  of  the  polarization-based  building  blocks  with  M  inlets  and  M  outlets,  that 
can  be  exploited  for  designing  of  FPxN2  routing  stages,  are  presented  in  Fig.  1 .  If  the  prism 
angle  a  satisfies  the  relation  cosa  =  2-0  J  (l+l/8/i2)0-5  +l/4n,  where  n  is  the  refraction  index  of 
PBS,  inputs  and  outputs  with  the  same  name  are  placed  on  the  optical  axis  that  passes  through 
them.  In  the  routing  setup  C  the  HWPs  depicted  by  dashed  lines  can  be  absent  if  the  output 
PBSs  are  turned  by  90°  about  the  horizontal  axis  [1],  The  setup  of  each  routing  block 
guarantees  equal  optical  path  lengths  of  links  between  input  and  output  ports  and  allows  to 
apply  optical  means  with  the  ultimate  numerical  aperture  u  -  nsinal(  1  +cosa)M,  nIM  and 
«/4M  for  blocks  A,  B  and  C,  respectively.  The  value  of  u  for  block  C  can  be  4-fold  increased  if 
lens  arrays  are  placed  not  only  at  inputs  and  outputs  but  also  before  QWPs  and  HWPs,  as  is 
done  in  [2]. 

In  the  blocks  A  and  B  polarized  beams  realize  butterfly  connections  between  inlets  and 
outlets,  s-polarized  beams  realize  crossover  connection  between  the  ports  1,  2,  ...  M/2  and 
l+M/2,  2+M/2, ...  M.  In  the  block  C  the p-polarized  beams  connect  ports  with  the  same  name 
and  the  s-polarized  beams  implement  crossover  communication  links  connecting  the  inputs  1, 
2,  ...  M/2  and  l+M/2,  2+M/2,  ...  M  with  the  outputs  M,  ...  2+M/2,  l+M/2  and  1/M,  ...  2,  1, 
respectively. 
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Three  possible  variations  of  2-D  nonblocking  networks  L,  n  [3]  and  in  for  an  architecture 
of  photonic  SSs  with  straight  optical  interstage  connections  are  shown  in  Fig.  2.  It  is 
demonstrated  in  the  report  how  time  networks  and  horizontal  and  vertical  routing  blocks  A,  B 
or  C  make  possible  the  efficient  interconnect  architecture  for  2N7x2N2  three-dimensional 
compact  high-throughput  photonic  nonblocking  switching  systems  with  many  ports  and  the 
number  of  switching  stages  K  *  2/ogN+l,  2N+1  or  4/ogN-l  The  main  characteristics  of  such 
SSs  are  evaluated. 

1.  Fyodorov  V.B.,  Optical  Computing  &  Processing,  vol.  3,  1993,  no.  1. 

2.  Jahns  J.J.,  and  Murdocca  M.J.,  Applied  Optics,  vol.  27,  1988,  no.  15. 

3.  Mirsalehi  M.M,  Shamir  J.,  and  Caulfild  H.J.,  Applied  Optics,  vol.  28,  1989,  no.  12. 


Fig.  1 .  Three  types  of  optical  routing  blocks, 
implemented  on  the  base  of  PBS  prism  A,  PBS 
cube  B  and  four  PBS  cubes  C.  PBS  -  polarizing 
beam  splitter,  HWP  and  QWP  -  half-  and 
quarter-wave  plates,  MIR  -  mirror. 
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Performance  Analysts  of 
Multistage  Interconnection  Networks 

Igor  G.  Voitenko  and  Toyohiko  Yatagai* 

Iastitute  of  Applied  Optics,  Byelorussian  Academy  of  Sciences 
212793  Mogilev,  Belarus 

'Institute  of  Applied  Physics,  University  of  Tsukuba 
Tsukuba, Ibaraki  30S,  Japan 
Tel:  +81-298-S3-5334 
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Abstract 

A  holographic  interconnection  technique  in  a  photorefractive 
waveguide  formed  by  titanium  and  ion  diffusion  in  LiNb03  is 
described.  A  banyan  network  using  this  is  designed.  An 
integrated  acoust  optic  switch  control  technique  is  also 
discussed. 

Problems  of  clock  distribution  and  data  communication  inside 
VLSI  chips  are  solved  by  using  optical  interconnections  with  optical 
fibers  and  holographic  techniques.  Volume  holograms  in 
waveguides  offer  a  straightforward  means  of  interfasing 
dynamically  reconfigurable  interconnections  with  integrated 
optoelectronic  devices.  In  this  papci  we  consider  holographic 
interconnections  between  nodes  and  stages  for  some  type  of  the 
optical  interconnection  networks  and  represent  experimental 
results  foT  holographic  interconnections  in  photorefractive 
waveguides  formed  by  titanium  and  iron  indiffusion  in  UNb03-  We 
have  recorded  photorefractive  holograms  in  single-mode  substrates 
titanium  and  iron  indiffused  waveguides  on  nominally  pure  y-cut 
LiNbOa  substrates.  We  define  the  z-axis  to  the  principal  axis  of 
propagation  in  the  waveguide.  Holograms  were  recorded  from 
above  the  waveguide  using  light  at  S14  nm  and  the  guided  TEmode 
at  633  nm  was  coupled  into  the  waveguide  using  a  rutile  and 

T.iVhO^  nrism  xnH  th^n  tn  r*rr*nstriirt  th*  hnlnarsimc 
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We  analyse  the  possible  optical  interpretations  of  the  traditional 
networks,  the  optical  interconnections  between  switching  elements, 
fault  detection  problems  of  this  networks  and  provide  a  new 
method  to  understand  and  prove  the  universality  optical  methods 
for  applicatioii  in  computer  systems.  We  demonstrate  first  that  a 
bitonic  sequence  can  be  routed  properly  by  a  banyan  network 
using  integrated-optic  holographic  interconnections.  First,  the 
architecture  and  the  working  principle  of  the  integrated  acousto¬ 
optic  space  switch  control,  functional  and  topological  equivalence 
this  networks  on  basis  of  the  optical  interconnections  are  described. 


nonwaveguide  bean 
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Fig.  Example  of  Holographic  Interconnection  in  Photorefractivc 
Waveguide. 
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Diagnostic  tectaiques  for  investigating  free-space  photonic  switching  systems 

R.  L.  Morrison,  F.  B.  McCormick,  T.  J.  Cloonan,  AX.  Lentine,  R.  A.  Novotny, 

J.  M.  Sasun,  M.  G.  Beckman,  M.  J.  Wojcik,  F.  A.  P.  Tooky*,  E  Kerbs  and  S.  J.  Hinteriong 
AT&T  Bell  Laboratories,  Naperville,  IL  60566 
*Heriot-Watt  University,  Edinburgh,  Scotland,  U.K. 

Over  the  past  few  years,  we  have  designed  and  assembled  five  bee-space  switching  photonic  system 
demonstrators1-2,3.  Our  goal  is  to  investigate  and  develop  the  technology  of  bee-space  digital  optics  as  a  framework 
for  exploring  high  throughput  interconnection  architectures.  Since  the  status  of  this  evolving  technology  is  often 
contrasted  against  the  capabilities  of  a  more  mature  electronic  field,  it  is  important  to  understand  the  operation  of  the 
demonstrators  in  order  to  promote  advanced  features  in  each  new  prototype.  In  addition,  the  feetflnek  obtained 
provides  a  foundation  far  gauging  the  future  potential  of  the  technology. 

The  integration  of  novel  opto-mechanical  frameworks,  diffractive  optics,  opto-electronic  device  arrays,  and  high 
power  semiconductor  lasers  requires  the  development  of  a  unique  array  of  diagnostics  tools  that  are  not  typically 
commercially  available.  In  this  talk,  we  describe  a  number  of  tools  that  we  have  implemented  for  characterizing 
components  and  systems.  These  tools  have  proven  essential  in  the  development  of  these  complex  demonstrators. 

The  process  of  designing  diagnostic  hardware  for  research  systems  ranges  from  careful  and  deliberate  planning, 
where  the  tool  is  craftily  integrated  into  the  system,  to  the  quick  assembly  of  equipment  to  investigate  unforeseen 
effects.  Typically,  the  first  instance  of  a  diagnostic  tool  relies  on  the  user  to  diligently  align  the  opto-mechanic 
framework  and  measurement  apparatus,  and  then  manually  record  and  analyze  the  measurements.  As  ow  knowledge 
of  the  critical  issues  increases,  diagnostic  tools  have  been  directly  integrated  into  the  system  during  the  design  phase. 
Computer  applications  are  also  increasingly  important  for  automating  complex  measurements,  monitoring  long  term 
stability  tests,  and  analyzing  experimental  results.  Our  experience  ranges  from  designing  custom  applications  for 
exercising  the  system  fabric  to  using  commercial  software  applications  for  coordinating  test  equipment 

Component  diagnostic  tools 

Beam  array  generation  holograms:  Phase  gratings  have  been  fabricated  that  generate  from  two  to  over  two 
thousand  uniform  intensity  spots.  Intensity  variations  are  primarily  due  to  limitations  of  the  fabrication  process.  A 
computer  controlled  microstage  aligned  pinhole  and  detector  is  used  to  characterize  smaller  arrays.  For  huger  arrays, 
a  custom  CCD  imaging  system  coordinated  with  image  processing  software  characterizes  the  arrays  (see  figure  1.) 
This  tool  has  also  been  used  to  measure  optical  signal  uniformity  throughout  the  system. 

Laser  optical  power  supplies:  The  current  demonstration  system  employs  six  high  power,  current  modulated, 
frequency  stabilized,  semiconductor  lasers  to  provide  optical  power  beams  at  the  device  modulator  windows.  It  is 
crucial  that  lasers  be  tuned  to  within  0  J  nm  of  the  850  ran  operating  point  and  that  they  maintain  this  wavelength  at  a 
variety  of  modulation  speeds.  Characterizing  the  lasers  requires  standard  testing  systems  such  as  monochromators, 
optical  power  meters,  beam  profilers,  and  high  speed  photodetectors.  A  Lab  View™  application  was  used  to 
coordinate  several  of  the  tests. 

Device  arrays:  An  automated  multiple  quantum  well  modulator  characterization  setup  measures  modulator 
reflectivity  and  photocurrent  as  a  function  of  voltage,  wavelength,  spot  position  and  optical  power.  In  addition,  each 
smart  pixel  array  is  tested  to  assure  that  the  cells  operate  at  the  design  speed. 

Polarization  beam  splitters:  Polarization  beam  splitters  are  used  in  the  system  to  combine  beam  arrays.  They  must 
operate  uniformly  over  an  angular  range  of  several  degrees,  a  more  stringent  requirement  than  is  typical  for  beam 
splitters.  A  software  application  was  used  to  control  the  diagnostic  equipment  and  then  analyze  the  data. 

Optical  components:  TVyman-Green  and  radial  shearing  interferometers  were  built  to  operate  at  850nm  to  examine 
the  quality  of  optical  components.  A  commercial  fringe  analysis  application  was  used  to  analyze  the  results. 

System  diagnostic  tods 

Fabric  control:  The  current  system2  is  a  five  stage  banyan  interconnection  network  connecting  32  inputs  with  16 
outputs.  A  total  of  512  path  combinations  exist.  Control  information  for  establishing  the  data  routes  must  be  optically 
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injected  into  the  system  and  latched  within  each  smart  pixel  at  the  appropriate  time,  at  rates  of  up  to  155Mbits  per 
second.  A  computer  baaed  control  system  is  used  for  both  initializing  the  system  and  presenting  a  nonthreatening  user 
interface.  A  DOS  based  PC  with  a  custom  written  C  program  was  designed. 

Mnttichannal  annlyris:  Cunendy,  proper  operation  is  assured  during  assembly  by  examining  optical  channels  at 
every  stage  by  aligning  an  avalanche  photodetector  with  the  magnified  image  of  the  data  modulated  spot  array. 

Fabric  analyser:  A  program  that  controls  a  mechanical  fiber  switch,  a  sampling  oscilloscope,  and  the  system  fabric 
was  developed  to  scan  each  of  the  S12  possible  paths  (see  figure  2.)  A  secondary  application  analyzes  the  data  to 
determine  if  the  data  matches  the  expected  bit  value  sequence  and  then  identifies  faulty  nodes. 

Stability:  Commercial  applications  of  free-space  digital  optical  technology  will  require  long  term  mechanical 
stability.  A  software  application  was  developed  to  monitor  positioning,  ambient  and  system  temperature,  detector 
photo-currents,  laser  wavelength  and  optical  power  fluctuations. 

Alignment:  The  current  smart  pixel  array  is  approximately  1000  pm  square  and  the  spot  arrays  must  be  aligned  to 
within  about  1  Jim.  A  high  magnification  viewport  was  designed  that  is  nonintrusive  (ix„  it  does  not  adversely  affect 
normal  operation  or  require  repositioning  of  components),  removable  (since  light  sampled  for  diagnostics  is  stolen 
from  normal  operation),  and  include  an  auxiliary  light  source  for  illuminating  the  device  array. 

In  the  future,  we  will  also  concentrate  on  further  development  of  bit  error  rate  characterizations,  long-term  stability 
and  lifetime  measurements,  and  a  two-dimensional  multi-channel  optical  sampling  oscilloscope. 

In  this  paper,  we  have  discussed  diagnostic  tools  developed  by  this  group.  We  would  also  like  to  acknowledge  the 
extensive  device  testing  effort  of  our  colleagues  at  Bell  Laboratories  facilities  in  Holmdel.  Murray  (fill  and  the  Solid 
State  Technology  center,  and  the  micro-optic  characterization  by  colleagues  at  the  Engineering  Research  Center. 
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Figure  1:  Intensity  distribution  of  a  64x32  spot  array. 


Figure  2:  Array  of  oscilloscope  scans  showing  a 
portion  of  the  512  paths. 
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A  cell-based  algorithm  for  designing  beam  array  generators 

Rick  L.  Morrison  and  Mike  J.  Wojdk 
AT&T  Bed  Laboratories 
Naperville,  IL  60566 

Current  free-qmce  digital  optical  systems  require  optical  power  supplies  that  generate  two  dimensional  arrays  of 
uniform  intensity  light  beams1.  One  method  for  creating  these  uniform  intensity  beam  arrays  is  to  illuminate  a 
computer  designed  Fourier  hologram  using  a  collimated,  high  power  laser  source.  These  surface  relief  gratings,  also 
referred  to  as  multiple  beamsplitters,  are  designed  using  scaler  diffraction  theory  via  a  computer  optimization 
process.  The  quality  of  the  hologram  is  quantified  by  its  diffraction  efficiency  and  the  relative  uniformity  of  the  beam 
intensities. 

We  present  a  new  algorithm  that  uses  a  pixelated,  discrete  phase  level  representation  of  the  hologram  to  examine  and 
modify  only  those  cells  that  form  the  boundary  between  phase  transitions,  thereby  substantially  reducing  processing 
requirements.  An  additional  feature  of  the  algorithm  is  the  capability  to  increase  the  spatial  bandwidth  of  the 
hologram  during  the  design  process.  Using  this  algorithm,  we  have  designed  holograms  with  up  to  4096x4096  phase 
cells  per  period  that  couple  light  into  arrays  containing  up  16,000  spots.  We  have  fabricated  a  number  of  smaller 
designs  for  use  in  our  frce-space  photonic  switching  program. 

The  algorithm  presented  here  is  similar  in  nature  to  the  IDO  (iterative  discrete  on-axis)  encoding  algorithm2.  In  the 
basic  IDO  algorithm,  one  period  of  the  two-dimensional  Fourier-plane  hologram  is  divided  into  an  array  of 
rectangular  cells  where  each  cell  imparts  one  of  two  fixed  phase  delays  to  the  incident  wavefront.  The  contribution  of 
each  cell  to  an  order  intensity  is  independent  of  the  other  cells,  thus,  each  lest  requires  calculation  of  only  a  small 
fraction  of  the  foil  set  of  diffraction  equations.  During  a  series  of  iterations,  each  cell  phase  is  reversed  and  this  new 
value  is  retained  based  on  its  contribution  to  the  merit  function  and  a  probabilistic  simulated  annealing  process. 

It  is  the  locations  of  the  level  transitions  that  determine  the  order  intensities.  Therefore,  in  the  this  new  algorithm,  it  is 
necessary  to  examine  and  modify  only  cells  along  phase  transition  boundaries.  The  result  is  a  significant  reduction  in 
processing  since  only  a  fraction  of  the  cells  are  tested  during  each  iteration.  An  additional  feature  is  the  ability  to 
increase  the  spatial  resolution  of  die  phase  array  during  the  optimization  process.  When  the  merit  function  value 
stagnates,  each  phase  cell  is  subdivided  into  4  parts  with  each  new  cell  given  the  original  cell’s  phase  value. 

Scaler  diffraction  theory  is  used  to  calculate  the  complex  amplitude  of  the  nth  order  in  the  mdi  row  as  given  by  the 
sum  of  the  contributions  from  each  of  the  JxK  phase  cells, 

J- 1  K- 1 

Am.»  =  Bm,JBn,K  X  X  Fm,j.J  Fn,k,KexP(iBj.J> 
j=0  k=0 

where,  is  the  phase  delay  associated  with  cell  (jjc),  and, 

Bm.J  -  ■ sin  (2*7) ' exp  »  31,(1  =  exP  (-2**y )  • 

The  intensity  of  the  order  is  given  by  the  complex  square  of  Am  n. 

The  quality  of  this  solution  is  measured  using  a  merit  function  that  is  a  combination  of  the  diffraction  efficiency  and 
the  deviation  of  the  older  intensities  from  their  target  values.  Uniform  intensity  arrays  as  well  as  those  containing 
suppressed  orders  and  arbitrary  orders  can  be  designed  by  selecting  the  proper  form  for  the  merit  function.  The 
parametrization  of  Has  function  has  been  observed  to  significantly  influence  the  design  time. 

First,  a  two  dimensional  data  array  is  defined  that  contains  the  JxK  pixel  phase  values  of  one  period  of  the  hologram. 
This  array  is  filled  with  a  random  distribution  of  multilevel  or  binary  phase  values.  We  have  also  included  the  ability 
to  load  the  phase  array  with  values  generated  by  a  separate  application  based  on  the  Gerchbeig-Saxton  [CS] 
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Wnt,  «*»  —p«n*i  n#  — «*»  JainMd  rmtmr ««  rilmlimrl  The  algnathm  Hiwi  eUMBM  eelh  with  the  objective  of 
iaapeovan  the  merit  facto  vitae.  A  aeries  of  four  comparisons  an  performed  where  aB  phase  values,  e^,  an 
coeapamd  against  4^  ut,  ®,4+l,  •,  _1Jk.  and  then  respectively,  tf  the  two  phase  values  are  equal,  no  action  is 

taken.  If  the  phase  values  dnfcr.  the  neighbor's  vatae  is  used  and  its  contribution  to  the  merit  fonctioa  is  calculated.  If 
the  merit  vitae  Is  improved,  the  new  phase  vatae  is  inserted  Into  tyj. 

Hie  algorithm  repeats  the  oeB  examination  and  update  sequence  until  either  a  termination  condition  is  met  or  the 
optimization  n>  proceed  with  the  optimization,  the  phase  array  is  subdivided  such  that  each  ceil  ix  replaced 

by  four  equivalent  cells.  The  search  process  is  then  revisited.  When  the  program  terminates  the  phase  army  is  stored 
in  a  data  file  which  is  laser  used  to  crease  die  lithographic  marit(s)  that  transfers  the  pattern  into  an  optical  surface. 

Uniform  intensity,  binary  phase  array  generators  prodndng  an  8x8  ipot  array  were  designed  in  about  12  seconds  on  a 
SUN  Sparcstalion  10.  An  equivalent  16  phase  level  8x8  generator  required  shoot  18  seconds.  A  32x32  ipot  array 
binary  phase  design  with  a  final  sixe  of  312x512  cells  per  quadrant  toot  42  minutes  to  generate.  When  the  Gercfabeig- 
Saxton  application  was  used  to  crease  an  initial  phase  distribation,  the  design  time  for  the  total  optummion  was 
reduced  to  16  minuses.  Typically,  binary  phase  aotarions  had  a  diffraction  efficiency  of  shoot  75%.  Figure  1  riwws  the 
hologram  designed  oaed  to  generate,  an  irregnlsriy  spaced  spot  array  used  in  a  fee-speoe  photonic  switching  system. 
A  picture  of  the  rasoiting  spot  array  from  a  fabricated  grating  is  shown  in  figure  2. 

in  summary,  this  new  algorithm,  in  conjunction  with  the  application  of  design  symmetries,  substantially  reduces 
proocaaiug  time  making  the  design  of  moderate  aiac  stray  generator!  feasible.  The  algorithm  accompliahrs  this  goal 
by  selectively  examining  only  cells  located  adjacent  to  a  phase  transition.  In  addition,  the  spatial  resolution  is 
increased  —  — «t«wl  «n«—i«g  Itm  Hwrign  »n  hajin  a  mm*  rimpl»riwti«l  Hiaiiludiiiii 
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Multilevel  diffractive  phase  elements  with  trapezoidal 
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Summary 

Digital  optics  and  material  processing  encourages  research  into  the  design  of  diffractive  phase 
dements  (DPEs)  that  split  an  incoming  tight  beam  into  an  array  of  prescribed  intensity  tight 
spots.  To  achieve  this  several  design  concepts  using  different  modulation  schemes  have  been 
proposed.  Of  these  die  optimisation  technique  of  simulated  annealing  (SA),  allied  with  a 
trapezoidal  shaped  surface-relief  phase  profile,  realised  some  of  the  highest  diffraction 
efficiencies  seen  for  binary  DPEs  to  date1.  Binary  dements  were  initially  designed  as  the 
process  for  fabricating  multilevel  phase  profiles  (Le.  microlithography)  introduced  an 
unacceptable  non-uniformity  in  the  desired  signal  wave2  despite  the  increase  in  the  diffraction 
efficiency.  Presently  electron-beam  lithography’s  direct-write  capability  can  be  used  to 
overcome  the  alignment  problem  intrinsic  to  microti thography.  Thus  fabrication  of  a  multilevel 
trapezoidal  shaped  phase  structure  with  acceptable  fabrication  error  becomes  a  possibility. 

The  design  of  a  multilevel  trapezoidal  shaped  phase  structure  with  optimised  diffraction 
efficiency  is  more  difficult  than  for  the  binary  case.  To  overcome  this  we  offer  a  design 
concept  that  is  embedded  in  tire  theoretical  framework  of  design  theory  of  diffractive 
dements3. 

As  only  the  intensity  of  die  desired  diffraction  pattern,  which  we  call  the  desired  signal 
wave,  is  specified  within  the  signal  windows  W,  die  phase  is  a  free  parameter.  The  first  design 
step  is  an  optimisation  of  the  signal  phase  with  respect  to  the  upper  bound  of  die  diffraction 
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efficiency,  as  a  function  of  the  number  of  phase  levels  allowed  for  the  element.  This  step 
furnishes  information  about  the  maximum  diffraction  efficiency  achievable  for  the  specified 
number  of  phase  levels.  Thus,  if  a  specific  diffraction  efficiency  is  required  for  an  application, 
this  step  gives  information  about  the  minimum  number  of  phase  levels  the  design  has  to  have 
in  order  to  achieve  the  desired  efficiency.  To  optimise  the  diffraction  efficiency  upper  bound, 
an  iterative  Fourier  Transform  algorithm  is  employed. 

The  projection  of  the  spectrum  of  the  phase  optimised  signal  wave  onto  die  allowed 
phase  levels  provides  a  set  of  phases  for  a  sampled  quantised  phase  hologram.  As  this  is  a 
sampled  data  set,  a  parameterisation  of  the  projection  is  needed.  We  use  a  parameterisation 
that  leads  to  trapezoidal  shaped  pulses.  This  step  yields  an  set  of  initial  parameters  that  are 
already  optimised  with  respect  to  diffraction  efficiency.  A  non-linear  optimisation  of  the  initial 
set  of  parameter  is  then  performed  in  order  to  increase  the  uniformity,  which  worsens  as  a 
result  of  parameterisation. 

This  design  concept  contributes  detailed  instructions  on  how  to  design  a  phase  grating 
with  optimised  diffraction  efficiency  as  a  function  of  a  specified  number  of  phase  levels. 
Assuming  that  a  specified  minimum  diffraction  efficiency  is  required  for  an  application,  the 
first  step  in  the  design  concept  gives  information  about  the  lowest  number  of  phase  levels 
required  to  achieve  this.  The  following  steps  result  in  a  initial  distribution  for  a  non-linear 
optimisation  that  is  near  the  desired  solution. 
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Abstract 

2D  fully  interconnected  processing  systems  where  the  output  array  is  smaller  than  the  input 
array  can  use  a  fan-out  interconnect  where  the  extra  channels  are  accomodated  within  the 
repeat  spacing  of  the  input  array. 

Introduction 

A  commonly  used  2D  fan-out  (fan-out  A),  where  the  input  array  is  replicated  at  neighboring 
spatial  locations  which  are  located  at  distances  of  at  least  the  array  size  from  the  original 
array,  is  illustrated  in  Fig.  1. 
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Fig.l  Conventional  2D  array  fan-out. 

A  complementary  fan-out  is  illustrated  in  Fig.  2  (fan-out  B).  It  is  complementary  in  the  sense 
that,  whereas  A  is  useful  when  the  subsequent  fan-in  is  to  an  array  of  larger  size  repeat 
spacing  than  the  input,  B  is  useful  when  the  output  array  is  of  smaller  repeat  spacing.  This  can 
be  useful  in  cascaded  interconnect  systems,  for  example. 
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Fig.2  Interfaced  2D  array  fan-out. 


A  further  advantage  of  B  is  that  it  allows  lower  resolution  fan-out  gratings  to  be  used  than  A. 
This  in  turn  eases  the  fabrication  tolerances  of  the  grating.  The  reduced  fan-out  angle  also 
allows  more  compact  systems  to  be  envisaged. 

Demonstration  of  interlaced  fan-out 

The  layout  of  the  optical  system  is  shown  in  Fig.  3.  Collimated  light  of  6  mm  diameter  from  a 
HeNe  laser  beam  is  incident  on  a  8  x  1  Dammann  grating  of  690  pm  period,  which  was 
fabricated  at  the  Paul  Scherrer  Institute,  Ziirich.  A  lenslet  array  of  45  mm  focal  length  and 
1 . 15  x  1 .4  mm  repeat  spacing  acts  as  an  array  of  Fourier  Transform  lenses  to  form  multiple 
fan-outs  in  the  focal  plane  of  the  lenslets.  Provided  that  the  grating  is  not  placed  at  the  focal 
plane  of  the  lenslet  array,  the  multiple  fan-outs  can  be  fanned-in  to  an  8  x  1  array  using  a 
simple  lens,  in  this  case  of  focal  length  80  mm.  The  spot  separation  at  the  output  is 
determined  by  the  distance  between  the  grating  and  lenslets.  In  this  case,  it  was  about  one  half 
the  focal  length  of  the  lenslets.  Figure  3  shows  the  beam  paths  for  one  of  the  diffracted  beams, 
which  produces  a  single  point  at  the  output. 
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Fig.  3  Component  layout  of  interlaced  fan-out. 


Results  and  discussion 

The  focal  plane  of  the  lenslets  contains  spatially  multiplexed  8x1  arrays  on  the  repeat 
spacing  of  1.15  x  1.4  mm  (Fig.  4a).  The  spot  size  within  the  array  is  variable  between  one  row 
and  another  of  the  lenslet  array.  However,  some  rows  produce  the  calculated  spot  size,  which 
is  about  one  half  of  the  repeat  spacing  within  the  area,  since  there  are  approximately  two 
grating  periods  per  lenslet.  In  the  focal  plane  of  the  lens  there  is  just  a  single  8x1  array  with  a 
140  pm  repeat  spacing  and  20  pm  spot  size  (Fig.  4b). 


Fig.  4.  Focal  plane  of  lenslets  (a),  and  rear  focal  plane  of  lens  (b). 


This  type  of  fan-out  can  form  the  basis  of  a  compact  interconnect  structure  because  the  input 

channels  are  well  separated  at  the  start  and  the  demands  on  the  optical  components  are  not 

high.  In  the  demonstrated  arrangement,  one  can  envisage  placing  a  liquid  crystal  television  < 

screen  in  the  focal  plane  of  the  lenslets  where  the  repeat  spacing  of  the  spots  in  each  array  is 

80  pm.  Smaller  focal  length  lenslets  with  a  smaller  period  grating  would  give  a  sharper  spot 

size  in  this  plane.  Shorter  focal  length  (16  mm)  microlens  arrays  have  already  been  used  for 

multiple  Fourier  transforms  [1].  Since  both  die  lenslets  and  lens  work  at  small  field  angles,  no 

special  lens  design  features  are  required.  In  order  to  apply  this  interconnect  in  a  real  system, 

better  quality  lenslets  are  required.  < 
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Considerations  for  free  space  microoptical  systems  in  layer  architecture 
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Abstact:  Architectural  concepts  for  micro-integrated  digital  optical  data  processing  systems 
are  presented.  From  these  concepts  a  microoptical  system  consisting  of  stacked  layers  of  micro¬ 
components  is  derived. 

Introduction:  Microoptical  systems  for  digital  optical  information  processing  impose  a  series  of 
requirements  on  the  optical  realization.  Light  efficiency  is  necessary  to  achieve  low  bit-error  rates 
with  the  available  optical  power.  For  constructing  complex  systems  from  standardized  parts, 
the  system  also  has  to  be  designed  in  a  modular  way.  As  interfaces  between  these  modules  and 
for  space  variant  operations,  real  images  of  the  data  planes  must  exist,  where  e.g.  masks  can  be 
placed.  For  space  invariant  operations  on  data  planes,  an  optically  generated  Fourier  plane  is 
necessary,  where  beam  forming  and  beam  deflecting  components  can  be  inserted. 

Fourier  stages:  Single  lens  2/  Fourier  stages  (fig.  1)  exhibit  certain  disadvantages  when  used  in 
microoptical  setups  consisting  of  arrays  of  identical  components.  A  lens  diameter  of  two  times  the 
pupil  diameter  is  necessary  to  avoid  vignetting.  In  an  integrated  system,  the  2 /  configuration 
requires  an  additional  spacer  in  order  to  place  the  lens  in  the  center.  From  the  fabrication 
point  of  view  it  would  be  more  desirable  to  place  the  lenses  at  the  substrate’s  surface.  A  more 
general  two-lens  configuration  is  shown  in  fig.  2,  which  is  identical  to  the  2/  system  when  b 
becomes  zero.  This  setup  is  chosen  symmetrically,  because  the  space-bandwidth  product  reaches 
a  maximum,  as  was  shown  in  [1].  Assuming  symmetric  configurations,  the  number  of  resolvable 
pixels  is  maximal  when  6  =  /  and  a  =  0  (2].  This  represents  the  ’light  pipe’  configuration.  In 
practical  implementations,  a  small  distance  a  is  often  required  for  technological  reasons.  With 
microoptical  systems,  space-bandwidth  is  a  critical  parameter.  The  distance  6  should  therefore 
be  chosen  close  to  /  to  obtain  maximal  resolution. 


Figure  1:  2/  system  Figure  2:  Two-tens-ftrarter'setup 


Pupil  division:  Digital  optical  systems  often  require  the  generation  of  multiple  copies  of  a 
data-  plane  and  the  superposition  of  different  dataplanes.  For  these  split  and  join  operations 
pupil  division  is  most  advantageous  since  the  join  operation  can  be  performed  almost  lossless  for 
an  arbitrary  number  of  joins.  Pupil  division  may  be  performed  either  by  splitting  the  aperture 
of  a  single  lens  or  by  using  a  lens  for  each  channel  in  the  pupil  plane.  In  microoptical  setups, 
multi-lens  pupil  division  is  preferable,  because  the  full  space-bandwidth  product  of  the  light 
pipe  is  available  in  all  channels.  Additionally,  multi-lens  pupil  division  can  also  be  used  for 
space  invariant  nearest  neighbor  interconnections. 
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Layer  architect  are:  A  transfer  of  these  consideration*  to  a  micro-integrated  system  is  shown 
in  fig.  3.  This  layer  architecture  consists  of  two  modified  lightpipes  with  multi-lens  pupil  division. 
Precise  alignment  can  be  achieved  if  each  layer  is  produced  with  lithographic  precision.  The 
light  pipes  can  be  realized  on  one  substrate  with  microlens  arrays  on  both  sides.  Mask  layers 
are  introduced  on  additional  spacers.  The  prism  layers,  realized  by  embossing,  perform  space 
invariant  split  and  join  operations.  Since  for  thermal  and  practical  reasons,  active  devices  should 
be  located  at  the  system  surfaces.  In  most  cases  active  devices  on  one  surface  are  sufficient. 
Cascading  of  several  stages  is  then  realized  by  placing  a  mirror  layer  on  the  top  system  surface, 
as  shown  in  fig.  4.  The  performance  and  typical  dimensions  of  these  layer  architectures  will  be 
discussed  in  the  presentation. 
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Figure  3:  Microsystem  in  layer  architecture  Figure  4:  Cascading  with  active  devices 
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Abstract 

An  active  alignment  system  is  demonstrated.  The  error  between  the  actual  and  desired  spot 
location  on  a  quadrant  detector  is  used  to  compute  the  angular  displacement  required  for  two 
Risley  Prisms  to  centre  the  spot 

Summary 

Free-space  optical  interconnects  hold  the  promise  of  alleviating  the  communication 
bottlenecks  which  will  arise  in  future  connection-intensive  electronic  systems.  A  separable 
interconnect  offering  the  massive  connectivity  of  free-space  optics  will  greatly  increase  die 
data  throughput  between  multi-chip-modules  and/or  printed  circuit  boards  on  an  optical 
backplane. 

A  key  issue  facing  free-space  photonic  circuit  design  is  that  of  alignment.  For  a  system  to  be 
of  practical  use,  it  must  be  capable  of  operating  continuously  over  long  periods  of  time, 
unaffected  by  external  vibrations  and  by  temperature  changes. 

At  least  two  approaches  exist  to  overcome  alignment  drift: 

1)  Design  an  extremely  rigid  system  which  will  not  drift  over  time.  This  can  be  performed 
by  removing  as  many  degrees  of  freedom  as  possible!!]  and  by  improving  pre-alignment. 

2)  Implement  active  alignment,  a  process  in  which  system  parameters  such  as  throughput 
or  error  in  spot  position  are  monitored  and  fed  back  to  a  controller  which  realigns  the 
system  by  altering  the  state  of  the  optics.  Such  simple  feedback  loops  exist  in  CD 

players!^]. 

An  example  of  active  alignment  we  are  currently  investigating  involves  a  simple  closed 
feedback  loop  which  centres  a  spot  on  a  quadrant  detector.  The  control  loop  has  the  following 
characteristics: 

Input.  Light  intensity  incident  on  a  quadrant  detector  generates  differential  photo-currents 
from  which  die  spot  position  can  be  determined. 

Output:  The  controller  outputs  the  desired  angle  of  rotation  to  two  stepper  motors  which 
rotate  two  Risley  Steer ers  located  at  the  focal  plane  of  a  lots.  The  rotational  displacement 
of  the  steer  ers  imparts  an  angular  displacement  to  die  beam  which  creates  a  lateral 
displacement  of  the  spot  on  the  quadrant  detector. 

Controller:  An  error  signal,  which  is  die  difference  between  die  ideal  spot  position  and  die 
detected  spot  position,  is  fed  back  into  the  controller  algorithm.  Given  the  error  signal  and 
the  current  rotational  position  of  the  steerers,  the  algorithm  computes  the  new  rotational 
position  of  both  Risley  Steerers  which  will  centre  die  spot  on  the  quadrant  detector. 

Embedded  in  the  control  algorithm  is  a  transform  (CtoR  transform),  which  maps  a  Cartesian 
(x,y)  displacement  to  a  Rotational  displacement  of  two  steerers  (6a,  6b)  A  CtoR  transform 
which  is  based  on  first  order  optics  can  be  expressed  as  follows: 

6A  =  ei  +0a 
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where 

01  »  Arctan  (y  lx)  (modified  to  give  an  angle  between  0  and  2k). 

0a  =  Arccos[Sqrt  (x^+y2)  /  (2r)J 

2r  =  2f  tan  (n  -  1)  8,  the  maximum  displacement  imparted  by  Risley  steer ers  of  wedge 

angle  8,  index  n,  and  located  at  the  focus  of  a  lens  with  focal  length  f. 

0A  and  0b  are  conventional  geometric  angles,  they  are  zero  on  the  positive  x  axis  and 
increase  in  a  counterclockwise  direction.  For  example,  point  (x,  y)  =  (2r,  0)  will  map  to  (0A, 
0B)  -  (0,0)  and  point  (x,  y)  =  (0,  2r)  will  map  to  (0A.  ©B)  =  (k/2,  k/2).  Given  the  physical 
nature  of  the  cities,  there  are  two  different  (0a,  0b)  values  for  any  given  (x,  y)  (x  *  0  and  y 
#  0).  The  above  transform  is  thus  only  one  of  a  family  which  will  perform  the  desired 
mapping.  Finally,  it  should  be  noted  that  from  a  control  perspective,  this  system  is  nonlinear 
since  multiplying  the  input  error  signals  by  a  scalar  k  will  not  correspondingly  multiply  the 
control  outputs  by  k. 

The  experimental  demonstration  features  two  40  mm  lens  and  two  Risley  Steerers  each  with 
a  IS  minute  wedge  and  n  =  1.509.  A  simple  beamsplitter  and  viewing  system  are  used  to 
directly  view  the  spot  on  the  detector.  The  maximum  possible  spot  displacement  for  our 
system  is  of  the  order  of  2r  =  177  pm  The  incremental  rotation  of  the  Risley  Steerers  required 
to  move  a  spot  by  Ax  =  1  pm  depends  on  the  spot  position,  but  is  usually  of  the  order  of 
0.65°.  The  setup  is  shown  in  Figure  1. 

The  controller  was  implemented  on  a  Macintosh  computer.  Inputs  are  obtained  in  two 
different  ways:  1)  by  focusing  the  spot  on  a  CCD  camera,  using  a  frame  grabber  to  define  a 
virtual  quadrant  detector  on  the  digitized  CCD  image  and  extracting  the  spot  centre  from  the 
digitized  CCD  image  2)  by  focusing  the  spot  on  an  actual  silicon  photodetector  and  measuring 
the  resulting  photocurrents.  Outputs  to  the  stepper  motors  are  sent  out  on  serial  electrical 
lines  from  the  computer. 
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Abstract 

The  optical  and  optomechanical  design  of  a  representative  portion  of  a  lenslet  array  based  free- 
space  photonic  backplane  is  described.  Issues  relating  to  the  optical  performance,  optomechanical 
layout  and  alignment  tolerances  will  be  discussed. 

Summary 

The  following  paper  will  consider  the  optical  and  optomechanical  design  of  a  representative 
portion  of  an  experimental  free -space  photonic  backplane.  The  system  uses  FET-SEED  modulators 
and  receivers  to  transmit  the  data  and  multi-level  diffractive  lenslets  to  interconnect  the  two 
device  planes.  This  work  forms  part  of  a  project  aimed  at  developing  terabit  capacity  photonic 
backpiane  technology. 

Integrating  free-space  optics  directly  into  a  backplane  is  a  challenging  problem  for  a  number  of 
reasons.  Current  electronic  backplanes  have  a  board-to-board  spacing  of  only  about  one  inch  (the 
card  pitch  for  Futu reboard  +  modules,  for  example,  has  been  set  at  30mm  [1]).  Any  competitive 
optical  backplane  must  therefore  be  compact  enough  to  fit  within  this  space.  In  addition,  the 
system  must  be  arranged  such  that  it  is  possible  to  remove  and  insert  circuit  boards  while 
maintaining  a  high  degree  of  alignment  between  the  optical  signal  beams  and  the  transmitter  and 
receiver  arrays.  This  places  exacting  demands  on  the  optomechanical  design  of  the  backplane. 

The  optical  design  of  the  backplane  demonstrator  is  shown  in  Fig.  1.  Linear  polarized  light  from  a 
Ti-Sapphire  laser  operating  at  850nm  enters  the  system  via  a  single  mode  polarization  preserving 
fiber.  A  periodic  binary-phase  grating  (BPG)  is  used  to  generate  an  array  of  optical  beams  which 
are  collimated  by  the  first  lenslet  array  and  reflected  by  the  polarizing  beam  splitter/quarter 
waveplate  assembly  onto  the  modulator  array.  The  BPG  utilizes  a  non-separable  grating  design 
and  has  a  theoretical  diffraction  efficiency  of  773%.  The  modulated  signal  beams  pass  through 
the  PBS  (the  polarization  of  the  light  having  being  rotated  by  the  quarter  waveplates)  and  onto 
the  receiver  array.  As  the  system  uses  dual-rail  logic,  the  signal  channels  were  designed  to  handle 
two  signal  beams,  one  for  each  FET-SEED  window  (the  FET-SEED  windows  have  dimensions  of 
approximately  25|unx25pm).  A  4-f  optical  layout  was  therefore  chosen  to  ensure  a  high  optical 
throughput  and  improve  the  tolerance  of  the  system  to  misalignment  errors.  Implementing  a 
photonic  backplane  using  lenslet  arrays  has  several  advantages;  it  can  be  made  compact  enough  to 
fit  within  the  limited  space  available,  it  does  not  suffer  from  field  dependent  aberrations  and  it 
will  not  be  affected  by  the  angular  dependence  of  polarizing  beam-splitters  [2}.  The  photonic 
backplane  demonstrator  which  we  shall  describe  has  a  channel  spacing  of  600  microns,  a  device 
plane  separation  of  just  over  30mm  and  was  designed  to  initially  interconnect  a  2x2  array  of  optical 
channels.  The  alignment  tolerances  required  by  this  system  include  a  focal  length  accuracy  for  f\  of 
±0.25%,  a  lenslet  array  positional  accuracy  of  t20pm,  a  defocus  error  of  ±50pm,  a  device  plane  tilt 
of  ±0.5°,  and  a  device  plane  roll  (with  respect  to  die  lenslet  arrays)  of  better  than  0.25°.  A  full 
discussion  of  the  alignment  tolerances  will  be  given. 

The  mechanical  layout  of  the  backplane  is  shown  in  Fig.  2.  The  optical  components  are  mounted  in 
steel  cylinders,  prealigned  and  positioned  using  a  magnetic  slot  channel  arrangement  [3].  The 
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baseplate  was  fabricated  from  Magnesium  AZ31B  tooling  plate  due  to  the  material's  high  flatness 
tolerance,  lightness  and  the  ease  with  which  it  can  be  machined.  The  magnets  are  mounted  on  top 
of  steel  bars  which  have  the  effect  of  increasing  the  restraining  force  on  the  components.  A 
chamfered  groove  structure  was  chosen  to  reduce  wear  and  tear  and  improve  alignment  tolerances. 
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Fig  1.  Optical  design  of  system. 


Fig  2.  Optomechanical  layout  of  backplane. 
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Abstract 

The  design,  manufacture  and  assessment  of  a  hybrid  lens  far  an  optical  computing  demonstrator  will 
be  discussed.  A  comparison  will  be  made  of  the  performance  of  tfae  microlens  singlets  and  doublets  used. 

Summary 

Microlens  arrays  have  been  proposed  as  components  that  will  enhance  the  performance  of  conventional  optics 
through  the  exploitation  of  their  small  diameter  and  focal  length.  Such  features  lead  to  compact  optics  that  can 
be  readily  integrated  with  semiconductor  devices.  Hybrid  lenses  (figure  1)  have  been  proposed  [1].  These 
devices  exploit  the  lxge  field  of  view  of  slow,  conventional  lenses  with  tfae  small  spot  size  that  can  be  achieved 
by  microknse8.  An  f/S  lens  was  designed  which  images  over  a  ±5°  field  angle  and  produces  spots  with  99%  of 
the  energy  coupled  into  a  13pm  spot  This  spot  is  imaged  by  an  afocal  telescope  composed  of  1/3  and  1/1 
microlens  doublets.  The  doublets  work  at  infinite  conjugates.  The  first  doublet  collimates  the  light  and  the 
second,  faster  microlens  doublet  focuses  the  light  into  a  small  spot  The  hybrid  leas  creates  an  array  of  small 
spots  over  a  large  field  (7mm).  These  hybrid  lenses  will  be  used  to  interconnect  two  S-SEED  arrays.  S-SEEDs 
comprise  two  3pm  windows  which  are  separated  by  20pm.  Imaging  two  spots  with  each  microlens  requires 
that  they  be  used  off-axis. 

Code  V  has  been  used  to  model  die  performance  of  the  hybrid  lens.  The  performance  is  evaluated  by  the 
calculation  of  the  encircled  energy.  The  spot  size  is  dependent  on  the  speed  of  the  final  microlens  doublet  and 
the  amount  of  spherical  aberration  introduced  by  the  afocal  pair.  The  diameter  of  the  collimated  beam  is 
determined  by  the  f/#  of  the  conventional  lens  and  the  focal  length  of  the  first  microlens  doublet  The 
aberrations  that  the  second  microkns  doublet  introduce  are  determined  by  the  size  of  this  collimated  beam. 

The  microlenses  doublets  have  been  produced  in-house  in  our  micro-fabrication  facility.  The  microlenses  are 
made  from  melted  photoresist  on  100pm  thick  substrates.  With  this  technique  we  can  fabricate  microlens 
arrays  with  f-numbers  ranging  from  1/2  to  f/B  and  individual  lens  diameters  ranging  from  30  to  230pm.  Thin 
substrates  are  required  as  the  afocal  relay  has  a  working  distance  of  130pm  and  tbe  focus  must  be  outside  of  the 
glass  to  permit  the  imaging  of  S-SEEDs.  The  microleas  array  doublets  are  assembled  using  our  mask  aligner 
which  is  equipped  with  a  stereo  microscope.  A  spacer  layer  (-20- 30pm  thick)  is  placed  between  the  two 
microleas  arrays,  and  the  doublets  are  secured  in  place  using  UV  curing  optical  cement  Our  measurements 
have  shown  that  an  alignment  accuracy  of  better  than  1pm  is  routinely  achievable. 

Interferometric  analysis  has  been  performed  on  the  microleas  doublets  and  afocal  pairs.  A  Mach-Zehnder 
interferometer  with  fringe  analysis  software  is  used  to  analyse  the  aberrations  introduced  by  the  microlenses. 
The  off-axis  performance  is  evaluated  by  changing  the  angle  of  the  beam  at  tbe  aperture  stop  of  the  bulk  leas. 
This  shifts  the  focused  spot  across  the  microlens  enabling  off-axis  measurements  to  be  made.  Measurements 
will  be  presented  of  the  performance  of  hybrid  lenses  including  the  achieved  spot  sizes  and  interferograms 
drawing  the  aberratioos  introduced  into  the  beam  by  the  hybrid  lens  (figure  2). 

[1]  McCormick  F.B.,  "Free-Spnce  Interconnection  Techniques,*  Chapter  13  in  "Photonics  in  Switching,*  ed.  by 
JJEL  Midwinter,  Plenum  (1993). 
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Fifure  1:  Ptin  of  doublets  relaying  and  magnifying  off-axis  input  spot 
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Abstract 

By  use  of  a  laser  writing  system,  an  array  of  four  off-axis  microlenses  for  perfect 
shuffle  interconnections  and  the  toric  microlenses  for  planar  optical  imaging  have 
been  fabricated. 


I.  Introduction 

In  the  fields  of  communication  and  optoelectronic  processing,  microoptical  interconnections 
make  a  potential  to  improve  the  degree  of  system  integration  and  stability.  Microoptical 
components  for  focusing,  imaging,  branching,  combining,  and  so  on,  have  progressed  beyond 
prototypes  by  the  fabrication  techniques  of  PR  melting,  ion  exchange,  deposition,  e-beam 
writing,  and  laser  writing.  The  writing  technique  using  e-beam  or  laser  beam  makes  it 
possible  to  produce  an  arbitrary  surface  profile,  such  as  the  continuous  surface-relief  grating 
for  spot  array  generators  or  a  set  of  diffractive  and/or  refractive  components  on  a  same 
substrate[l-2].  In  this  work,  the  laser  writing  technique  is  described  to  make  an  array  of  four 
microlenses  specially  designed  for  perfect  shuffle  interconnections,  as  well  as  toric  microlenses 
for  planar  optical  imaging. 

II.  Laser  writing  system 

Figure  1  shows  the  laser  writing  system  which  consists  of  air-cooled  A r+  laser  (X=458nm, 
4mW),  acousto-optic  modulator,  XY  positioning  stage  with  a  moving  resolution  of  0.1pm,  and 
CCD  camera  for  monitoring  the  focusing  of  the  incident  beam  on  the  sample  substrate.  The 
sample  substrate  of  a  10  pm  thick  positive-PR  (AZ4562)  layer  coated  onto  glass  substrate 
was 


Figure  1.  Scheme  of  laser  writing  system 

exposed  by  the  raster  scanning  process,  and  developed  with  die  solution  of  AZ500MIF.  Based 
on  die  gamma  curve  of  the  PR  layer,  any  surface-relief  micro-structure  can  be  fabricated  as  it 
was  designed.  The  spot  size  of  die  incident  beam  focused  on  the  PR  layer  was  measured  by 
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2.3tun.  Therefore  this  system  would  be  more  useful  for  rt  active  microoptics  with  larger 
dimensions  than  for  diffractive  ones. 

III.  Results 

We  have  fabricated  a  set  of  four  off-axis  microlenses  for  the  application  of  perfect  shuffle 
interconnections.  Figure  2(a)  shows  the  interferogram  of  the  microlenses  obtained  by  the 
Mach-Zehnder  interferometer.  The  dimension  of  one  microlens  was  (d= 100pm and  the  axis 
was  shifted  toward  the  center  of  four  lenses  with  the  amount  of  d/4  =  25pm.  This  configuration 
enables  to  implement  die  perfect  shuffling  of  the  2-D  arrays  of  input  signals  at  four  quadrants. 
Figure  2(b)  shows  the  four  spots  with  the  d/4  spacing  generated  by  the  four  off-axis 
microlenses,  and  one  of  the  wavefronts  after  passing  through  the  microlenses  is  depicted  in 
Fig.2(c).  There  are  some  of  microlenses  with  a  spherical  shape  of  Fig.3(a)  shown  as  an 
interferogram  and  an  aspherical  (toric)  shape  of  Fig.3(b).  Further  detail  characteristics 
including  optical  aberrations  of  the  microlenses  in  Fig.2  and  Fig.3,  will  be  discussed  in  the 
presentation. 

References 

1.  M-  T.  Gale,  M.  Rossi,  H.  Schutz,  P.  Ehbets,  H.  P.  Herzig,  and  D.  Prongue,  Appl. 

Opt.,  32,  2526(1993). 

2.  T.  Shiono  and  H.  Ogawa,  Opt.  Lett.,  17,  565(1992). 


Figure  2.  Array  of  four  off-axis  microlenses  for  perfect  shuffling,  (a);  interferogram, 
(b);  focused  spots,  and  (c);  wavefront  after  passing  through  one  of  the  microlenses. 


Figure  3.  Interferograms  of  the  fabricated  microlenses  with  a  spherical  shape  in  (a) 
and  an  aspherical  (toric)  shape  for  imaging  an  input  with  oblique  incident 
angle  of  30  deg.  info). 
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ABSTRACT 

We  describe  the  fabrication  of  self  aligned  arrays  of  microlenses 
and  fiber  holders  on  the  same  PMMA  substrate  using  a  suitable 
deep  proton  irradiation. 


SUMMARY 

For  almost  all  fiber  applications  the  combination  of  fibers  and 
microlenses  is  of  evident  relevance.  A  microlens  can  couple  light 
from  a  source  into  a  fiber  and  collimate  light  coming  out  of  a 
fiber.  In  both  cases  the  longitudinal  and  the  lateral 
displacement  between  fiber  and  lens  have  to  meet  strict 
tolerances.  Also  of  interest  is  the  alignment  between  arrays  of 
fibers  and  arrays  of  microlenses,  which  is  of  particular 
relevance  to  optical  interconnects  and  optical  computing. 
Problems  arise  from  the  fact  that  conventional  methods  of 
fabricating  fiber  holder  and  microlens  arrays  are  fairly 
distinct.  This  is  not  the  case  for  a  technology  called  deep 
proton  irradiation1*2.  With  this  technique  it  is  possible  to 
fabricate  lenses  on  the  top  side  of  a  polymethyl  methacrylate 
(PMMA)  layer  and  fiber  holder  holes  on  the  back  side.  The  PMMA 
layer  can  have  a  thickness  of  several  hundred  pm  up  to  more  than 
1  mm.  This  results  in  high  holder  aspect  ratios  for  a  standard 
fiber  (125  pm)  having  for  example  a  500  pm  deep  holder  hole.  A 
simulation  of  the  dose  deposition  in  a  500  pm  thick  PMMA  layer 
irradiated  with  2*1012  protons/cm2  of  7  MeV  is  shown  in  fig.  1. 
The  figure  shows  the  lines  of  equal  dose  deposition  in  a  side 
view  to  the  irradiated  PMMA  substrate.  As  can  be  seen  from  this 
figure  in  the  shadow  region  (left  hand  side)  the  dose 
distribution  shows  fan  shaped  lines  caused  by  the  straggling  of 
the  incident  protons  after  multiple  collisions.  This  can  be  used 
to  achieve  fan  shaped  fiber  holders  which  facilitates  the  fiber 
positioning  and  tightening.  The  alignment  between  a  microlens 
array  and  a  fiber  holder  array  can  be  obtained  by  irradiation  of 
two  PMMA  layers  through  the  same  mask,  one  layer  for  a  microlens 
array  the  other  for  the  fiber  holder  array.  After  irradiation  the 
two  layers  are  handled  separately  in  different  processes  and 
then  recombined.  More  sophisticated  is  the  use  of  only  one  PMMA 
layer  with  only  one  exposure  and  a  separate  handling  of  the  two 
sides.  Then  the  microlenses  are  placed  directly  above  the  fiber 
holder  holes  and  are  therefore  best  aligned  to  the  hole.  An 
approach  to  a  monolithic  solution  using  the  longitudinal  dose 
distribution  (fig.  1  right  hand  side,  irradiated  region)  is  in 
progress.  Experimental  results  like  in  fig.  2b  will  be  discussed. 
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Fig.  1  Dose  distribution  in  PMMA  under  a  vertical  edge  after 
irradiation  with  a  dose  of  2*1012  protons/cm2 


Fig. 2b  SEM  picture  of  an  experimental  result, 
1st  step:  side  view  of  a  fan-shaped  hole  in 
PMMA.  Top  diameter:  250  pm,  bottom  diameter: 
270  pm. 
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Phase-matched  Fresnel  elements  have  been  fabricated  for  laser  array  to  fibre-cable 
coupling  and  for  fan-out  operations.  The  continuous-relief  planar  microoptical  elements 
are  produced  by  direct  laser  writing  in  photoresist  and  copied  by  replication  techniques. 

1.  Introduction 

Parallel  optical  links  are  attracting  increasing  interest  in  the  fields  of  data  communications  and 
optical  computing.  Typical  applications  include  lens  arrays  for  light  source  to  fibre  array  coupling  and 
fan-out  elements  for  parallel,  high-speed,  high-capacity  interconnects  for  optical  processors  and  data 
links.  The  paper  will  describe  progress  in  the  realisation  of  Phase-Matched  Fresnel  Elements  (PMFEs 
[1])  for  such  applications.  These  computer-generated  Diffractive  Optical  Elements  (DOEs)  are 
fabricated  as  surface-relief  microstructures  by  direct  laser  writing  in  photoresist  and  can  be  reproduced 
by  replication  techniques.  PMFEs  can  be  produced  with  very  high  numerical  aperture,  arbitrary 
aperture  shape  and  as  large  area,  close-packed  arrays.  Examples  of  laser  diode  to  fibre  imaging  lens 
arrays  with  high  numerical  aperture  and  of  focusing  Nxl  fan-out  elements  will  be  described. 

2.  Fabrication 

The  fabrication  of  Phase- matched  Fresnel  elements  with  continuous-relief  surface  profiles 
represents  a  challenging  area  of  modem  optical  fabrication  technology,  in  particular  for  high  aperture 
lenslets  with  segment  sizes  in  the  order  of  micrometers.  Laser  writing  (Fig.  1)  offers  a  highly  flexible 
approach  for  the  fabrication  of  such  DOEs.  Technology  for  direct  laser  writing  in  photoresist  has  been 
developed  over  a  number  of  years  at  the  Paul  Scherrer  Institute  in  Zurich.  The  system,  described  in 
more  detail  elsewhere  [2],  uses  a  HeCd  laser  to  expose  a  photoresist  coated  substrate  which  is  raster 
scanned  under  the  focused  beam  using  a  high  precision  xy-stage.  In  this  paper  we  will  present  latest 
results  for  PMFEs  fabricated  using  this  technique  and  discuss  the  limitations  in  microstructure 
resolution  and  lenslet  numerical  aperture  as  determined  by  current  technology. 

PMFE  microstructures  can  be  characterised  by  the 
maximum  relief  depth  and  the  minimum  segment 
size.  In  laser  writing  technology,  the  maximum 
depth  is  given  by  the  resist  layer  thickness,  typically 
-  5  pm.  The  minimum  segment  size  at  the 
perimeter  of  the  lenslet  area  is  determined  by  the 
numerical  aperture  (NA)  of  the  lenslet,  together  with 
the  phase  (height)  step  at  the  segment  boundary. 

The  maximum  aperture  of  Fresnel  lenslets  is 
determined  by  the  size  of  a  segment  in  which  the 
continuous-relief  profile  can  be  reasonably  attained 
using  the  writing  spot  size;  for  a  relief  depth  of 
5  pm,  this  typically  corresponds  to  a  maximum 
lenslet  aperture  of  NA  -  0.5  (phase  step  of  8ft). 

Lenslets  with  this  NA  have  a  dominantly  diffractive  Fig.  1.  Continuous-  relief  Fresnel  elements  are 
behaviour  and  function  best  with  monochromatic  fabricated  by  direct  writing  in 

(laser)  or  narrow  band  (LED)  illumination.  photoresist. 


WP23/276 


Fig.  2.  Interlaced  PMFEs  for  focusing  fan-out  functions  - 

schematic  and  AFM  image  of  a  fabricated  microstructure. 


Resist  surface-relief  microstructures  are  electroformed  to  a  Ni  shim  and  reproduced  by  replication  in 
plastic  or  epoxy  materials.  Replication  technologies  which  are  commercially  available  and  currently 
unda  investigation  include  hot  embossing,  injection  moulding  and  uv- replication  techniques. 

3.  Fresnel  lens  arrays  for  laser  to  fibre  coupling 

High  numerical  aperture,  planar  micros tructures  and  flexible  design  in  shape  and  array  formats 
are  attractive  features  for  the  application  of  PMFEs  in  the  coupling  of  laser  diode  arrays  to  fibre  ribbon 
cable.  An  example  will  be  presented  of  a  PMFE  array  designed  for  coupling  an  array  of  laser  diodes 
(X.  =  831  nm)  into  a  ribbon  of  12  fibres.  The  laser  diodes  each  emit  an  astigmatic  beam  with  divergence 
angles  9j_  =  8°  and  0//  =  28°  (FWHM)  in  the  horizontal  and  vertical  planes.  The  MT-connector 
compatible  ribbon  cable  consisted  of  12  multimode  fibres  with  0.21  NA  and  SO  pm  core  diameter.  The 
PMFE  fabricated  achieved  a  measured  efficiency  of  60%  with  a  laser  to  fibre  separation  of  1.65  mm. 

4.  Spatially  interlaced  lenses  for  fan-oat  applications 

Novel  PMFE  structures  have  been  designed  to  perform  focusing  fan-out  functions  (see  Fig.  2.)  in 
a  single  planar  microoptical  element  The  desired  fan-out  function  is  implemented  by  combining 
different  PMFEs  (e.g.  one  for  each  interconnection  channel)  in  a  special  area  sharing  arrangement 
Each  PMFE  is  divided  into  a  subarray  structure,  leading  to  an  array  of  focused  diffraction  orders 
centred  around  the  focal  point  of  the  basic  PMFE.  The  period  of  the  different  subarray  structures  is 
chosen  such  that  the  diffraction  orders  coincide  with  and  are  coherently  superimposed  upon  the  desired 
image  points.  Simple  and  fast  procedures  have  been  developed  for  optimising  such  fan-out  PMFEs, 
based  upon  a  low  number  of  well-defined  physical-optical  parameters.  The  concept  can  easily  be 
extended  for  realising  2D  fan-out  elements. 

Such  fan-out  elements  have  been  fabricated  by  direct  laser  beam  writing  in  photoresist  The  splitting 
and  spatial  interlacing  procedures  are  performed  in  real-time  during  the  writing  process.  A  major 
advantage  of  the  PMFE  fan-out  approach  is  tire  large  tolerance  with  respect  to  fabrication  errors.  In  a 
series  of  experiments  in  which  depth  scaling  errors  over  a  range  of  ±20%  were  introduced,  PMFE  fan¬ 
outs  showed  a  uniformity  error  of  <  5%.  This  is  much  superior  to  the  performance  of  conventional 
surface  relief  fan-out  elements  for  which  a  uniformity  reduction  of  10%  typically  results  from  a  depth 
scaling  error  of  only  a  few  percent.  Latest  results  will  be  presented  in  the  paper. 


[1]  R.E.  Kunz  and  M.  Rossi.,  "Phase-matched  Fresnel  elements".  Opt  Comm.  97, 6-10  (1993). 

[2]  M.T.  Gale,  M.  Rossi  and  H.  Sc  Witz,  "Fabrication  of  continuous-relief  microoptical  elements  by 
direct  laser  beam  writing  in  photoresist,"  Proc.  SPIE,  2045  (1994). 

This  work  was  supported  in  part  by  the  Swiss  Priority  Program  OPTIQUE. 
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Strategies  for  realising  large  computer  generated  holograms 
from  the  calculation  of  sub  holograms 

by 

R.  L  Young,  A.  EL  MacGregor  and  C.W.  Slinger 

Abstract 

Several  schemes  that  decrease  the  calculation  time  of  a  computer  generated  hologram 
through  utilising  sub  holograms  are  presented.  The  applicability  of  these  schemes  to 
differing  encoding  schemes  and  design  criteria  is  discussed. 

Summary 

The  design  of  computer  generated  holograms  (CGH)  is  an  optimisation  calculation  in  a 
solution  space  given  by  all  the  possible  pixel  configurations  for  the  hologram1. 
Generally  the  assessment  of  the  replay  of  a  CGH  involves  evaluating  all  the  sampling 
points  simultaneously.  This  leads  to  calculations  involving  large  arrays.  Shortening 
the  calculation  time  is  achievable  by  reducing  the  number  of  pixels  in  the  CGH  needing 
revaluation.  One  possibility  for  doing  this  is  to  only  calculate  certain  pixel  values  by  an 
updating  methodology2. 

This  paper  presents  three  strategies  that  allow  Fourier  CGH  design  by  calculating 
sequentially  small  regions  of  solution  space,  i.e.  sub  sections  of  the  desired  hologram. 
The  common  feature  of  these  schemes  is  a  decrease  in  the  time  taken  to  realise  the  final 
hologram  by  reducing  the  size  of  data  requiring  Fourier  Transforming  during  the 
calculation.  Additionally  convergence  time  of  the  algorithm  falls  rapidly  with  decrease 
in  pixel  number.  Of  these  schemes  the  expansive  and  partitioning  methodologies  are 
novel  whilst  the  Interlaced  Technique  (IT)  has  appeared  in  the  literature3. 

The  effects  of  sub  hologram  size  upon  encoding  errors,  replay  fidelity,  convergence 
time  and  number  of  iterations  are  given.  The  manipulation  of  solution  space  is 
demonstrated  for  two  well-known  CGH  design  algorithms,  the  Iterative  Fourier 
Transform  Algorithm  (LFTA)  and  Direct  Binary  Search  (DBS). 

The  applicability  of  manipulating  solution  space  to  CGH  design  involving  differing 
design  criteria  (e.g.  complex  amplitudes  or  intensities)  for  the  evaluation  of  replay 
fidelity  it  also  discussed. 

A  comparison  of  the  performance  of  all  three  methodologies  is  made  with  respect  to 
each  other  and  also  to  the  equivalent  single  large  calculation.  While  the  new  schemes 
all  converge  quicker  than  the  single  calculation,  they  introduce  a  redundancy  of 
information  into  the  CGH  leading  to  poorer  replays.  It  is  shown  how  redundancy 


•  •  •  •  .  •  • 
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•rues  within  these  schemes  and  how  a  further  iterative  design  can  circumvent  some  of 
these  problems. 

From  these  assessments,  possible  cases  and  applications  where  the  manipulation  of 
solution  space  is  beneficial  in  comparison  with  a  angle  calculation  are  given. 

RafenmcM 

1  Digital  holography  as  part  of  diffractive  optics.  Wyrowski  F  &  Bryngdahl,  O. 
Rep  Prog  Phys,  54,  Pg.  1481-1571,  1991. 

2  Efficient  design  of  DBS  CGH.  JennisonBK,  Allebach  JP  &  Sweeney  DW.  J 
Opt  SocAmA,  8(4),  Pg  652-660,  April  1991. 

3  Iterative  interlacing  approach  for  synthesis  of  CGH.  ErsoyOK,  Zhuang  JY  & 
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Optical  Implementation  of  Crossover  Optical  Interconnect  Network 
using  Denman n  Grating  with  66X66  Spot  Arrays 
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(The  National  Laboratory  On  Laser  Technology,  Buaahong  university 
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Abstract 

Danmann  grating  with  66X65  spot  arrays  has  been  optimised  using  SAT-GRG 
optimised  algorithm  and  fabricated.  Crossover  optical  interconnect  network 
of  64X64  pixel  arrays  with  Dammann  grating  has  been  performed. 

1.  Introduction 

Danmann  grating  is  binary  phase  grating  that  can  be  used  to 
generate  equal  intensity  spot  arrays.  111  Resently.  the  interest  in 
Danmann  grating  has  been  stimulated  by  the  development  of  optical 
computing.  In  optical  computing  system,  the  sizes  N  of  arrays  as  high  as  64 
x64,  or  more  are  required.  Due  to  complexity  of  optimisation  in  the  case 
of  large  spot  arrays,  application  of  Dammann  grating  will  be  limited1*3. 

In  order  to  optimize  Danmann  grating, we  present  a  efficient  optimized 
algorithm:  simulated  annealing  -tempering-generalised  reduced  gradient 
algorithm  (SAT-GRG).  Dammann  grating  with  66X65  6pot  arrays  is  fabricated 
and  used  in  crossover  optical  interconnect  network. 

2.  Optimization  and  fabrication  for  Dammann  grating 

The  principle  of  Dammann  grating  is  based  on  Fraunhofer  diffraction 
theory.  Dammann  grating  is  illuminated  by  plane  wave  from  a  laser  source. 
The  output  pattern  appears  in  the  back  focal  plane  of  a  Fourier 
converging  lens.  For  the  optimisation,  we  define  an  objective  function: 

Merit=£  lim(i)-l]a  (l) 

Im(x)  is  the  diffraction  intensity  of  m-th  order,  1  is  the  mean  value  of 
all  intensity.  Simulated  annealing  algor i thm (SA)  is  one  of  optimisation, 
but  its  shortage  is  that  it  requires  large  quantity  of  calculation  for 
large  spot  arrays,  in  order  to  reduce  the  calculation  quantity  and 
enchanee  the  calculation  efficiency,  we  add  a  tempering  process  to  the  SA 
algorithm,  SAT  algorithm  avoids  too  mueh  stay  at  the  local  minimum 
magnitude  of  merit  function.  Furthermore,  on  the  base  of  SAT  algorithm  we 
add  a  generalised  reduced  gradient  algorithm!  GRG)  with  much  higher 
precision  solution.  Vith  SAT-GRG  algorithm,  the  structure  of  66X66  array 
of  Dammann  grating  with  non-uniformity  of  0.4%  and  diffraction  efficiency 
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of  82%  is  obtained.  In  order  to  fabricate  Dammann  grating  with.  66  x  66 
spot  arrays,  the  VLSI  technique  is  used  in  our  experiment,  by  Si*N4  film 
deposition  on  quarts  glass  substarte,  photolithographic  and  reactire  ion 
etching,  the  phase  structure  of  Dammann  grating  is  transfer  onto  the  Si*N4 
film,  the  Dammtnn  grating  of  66X66  spot  arrays  is  achiered. 

3.  Optical  implemention  of  crossorer  network  with  64X64  pixel  arrays 

The  optical  setup  of  one  stage  of  crossorer  network  is  shown  in  fig.  1. 
The  beams  from  laser  is  projected  onto  Daemacc  grating!  D)  through 
combination  lens (ll).  a  mask  with  character  'E*  is  placed  on  the  focal 
olane  of  lens(L2)  where  a  input  pattern  of  character  *E*  with  64  X  64 
oixels  is  produced.  The  beamsplitter  (BS)  split  the  input  pattern  into  two 
diferent  paths.  The  path  E-P2-CCD  with  a  mirror  in  P2  plane  implements 
straight  connection.  The  path  E-Pl-CCD  implements  cross  connection.  The 
output  pattern  of  i-0  stage  for  cross  connection  can  be  achiered  by 
placing  a  reflecting  90°  prism  in  Pi  place.  For  implemention  output 
pattern  of  i-stage  cross  connection,  we  can  place  a  prism  grating  with 
the  period  of  21  in  the  plane  of  Pi.  Using  six  such  setup,  we  can 
complete  crossorer  network  with  64X64  spot  arrays. 

4.  conclusion 

Dammann  grating  with  65X66  spot  arrays  has  beet  optimized  using  SAT 
-GRG  algorithm  and  fabricated. Crossorer  optical  interconnect  network  of  64 
x 64  spot  arrays  with  Dammann  grating  has  been  performed.  This  setup  has 
the  adrantage  of  equal  optical  path  length  and  no  light  energy  loss. 

Reference 

[1]  Dammann  D,  Gortler  K,  Opt.  Coomun. ,  1S70,  3:312-316. 

[2]  Jahns  J.  et  al. ,  Opt.  Eng. ,  1989.  28  (121  :  1267. 
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Fig.  1,  Optical  setup  of  one  stage  of  crossover  network 
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Three  dimensional  distribution  of  light  generation  by  a 

diffractive  element 
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In  many  laser  and  optics  applications  it  is  necessary  to  govern  the  wavefront  (WF) 
propagation  through  long  distances.  These  include  storage  and  display  of  information, 
spatial  distribution  of  energy,  optoelectronic  interconnections,  precision  measurements 
and  alignment.  Fourier  and  Fresnel  computer  generated  holography,  do  not  provide  this 
capability.  Most  of  the  known  techniques  can  be  used  only  to  generate  a  desired  field 
intensity  distribution  over  a  specified  transversal  plane  and  sometimes  on  a  limited  depth 
of  field. 

The  problem,  in  its  most  general  form,  can  be  stated  as  follows:  ‘Given  a  known 
WF  incident  on  a  diffractive  element,  design  this  element  to  obtain  a  desired  intensity 
distribution  within  a  given  three-dimensional  domain.’  It  is  obvious  that  physics  does 
not  allow  a  solution  for  any  arbitrary  distribution  but  it  does  in  many  cases  of  practical 
interest.  Moreover,  if  a  solution  does  not  exist,  it  still  may  be  valuable  to  derive  the 
closest  solution  to  our  constraints.  This  is  clearly  an  optimization  problem  for  which 
many  known  algorithms  can  be  applied.  In  our  approach  we  consider  a  region  of  space 
behind  the  diffractive  element  where  we  want  to  control  the  beam  propagation,  and  in 
this  region  we  impose  the  necessary  constraints  on  sufficiently  close  transversal  planes. 

Consider  a  given  WF  incident  on  a  diffractive  element  at  the  input  plane  of  the  system 
in  Fig.  1  which  then  propagates  in  free  space  a  distance  d\  towards  the  region  D.  Certain 
conditions  on  the  field  intensity  distribution  are  to  be  imposed  in  this  region,  up  to  a 
distance  d2  from  the  input. 
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Figure  1:  The  basic  scheme  for  WF  synthesis  in  the  region  D. 
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The  complex  amplitude  over  a  plane  at  a  distance  z  can  be  described  by  the  Fresnel 
transform, 

F((,V,z)  =  FrT  (f(x,y))  (1) 

The  constraints  prescribed  on  this  plane  can  be  transformed  into  constraints  on  the  field 
at  z  =  0+  and  represented  as  Cx. 

Using  the  iterative  algorithm  of  projections  onto  convex  sets  we  can  associate  with 
each  constraint,  Cx,  a  projection  operator  Vx.  By  definition,  g  =  Vzf  is  the  projection  of 
/  onto  Cx  if 

\\f-j\\  =  xnft€C.\\f-x\\  (2) 

where  ||/||  =  is  the  norm  of  /,  and  ( f,g )  =  / \f\\g\dxdy.  If  Cx  is  closed,  g  €  Cx 

and  if  it  is  convex,  the  projection  is  unique. 

Given  a  function  /  describing  the  input  field,  we  find  its  projection  onto  the  constraint 
Cx  by  calculating  the  field  at  the  distance  z,  projecting  this  function  onto  the  constraints 
of  that  plane  and  finally  performing  the  inverse  FrT  to  obtain  the  corresponding  field  at 
the  input: 


g  =  Vzf  =  FrT-1  {PzFrT  (/)}  (3) 

where  Vz  represents  the  projection  operator  on  the  plane  at  the  distance  z.  This  is  possible 
since  the  FrT  satisfies  Parseval’s  theorem. 

If  we  take  n  planes  we  have  a  set  of  constraints  C  =  Cj...Cn.  In  the  problem  under 
consideration,  a  function  /  is  searched,  which  lies  simultaneously  in  all  sets  C,  ( i  —  l...n), 


i=i 


(4) 


It  is  not  possible  to  perform  the  projection  onto  C  directly,  but  in  the  case  of  convex 
sets  the  problem  can  be  solved  by  a  recursive  process  using  the  composite  operator  T  = 

'Pn'Pn-\...'Pi'. 

fm  =  Tmfo  =  Tfm-i  m  =  0,1,2...  (5) 


where  /o  is  an  arbitrary  initial  function. 

We  assume  that  the  intensity  pattern  does  not  change  substantially  between  adja¬ 
cent  constraint  planes.  Therefore,  the  maximal  local  axial  frequency  must  be  taken  into 
account  to  choose  the  distance  A z  between  these  planes: 


A(z)  =  2ir 


kr2  +  4  irz 


(6) 


Preliminary  experiments  indicate  that  the  proposed  approach  leads  to  efficient  so¬ 
lutions  for  various  problems.  As  examples,  “nondiffracting  beams”  and  “nondiffracting 
beam”  arrays  were  demonstrated.  These  “nondiffracting”  arrays  can  be  useful  for  optical 
interconnections,  precision  alignment  and  measurements. 

The  procedure  adapts  the  design  to  the  available  space  bandwidth  and  to  the  imposed 
constraints.  As  the  method  is  not  restricted  to  incident  plane  waves,  the  design  of  diffrac¬ 
tive  elements  can  be  adapted  to  any  WF  such  as  Gaussian  beams  coming  directly  from 
laser  sources. 
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Abstract 

64  images  of  128x128  pixels  each,  have  been  multiplexed  in  a  same  volume  of  a  BaTi03 
photorefractive  crystal  with  the  deterministic  phase  encoding  technique.  We  compare  the 
performances  of  that  memory  with  the  theoretical  predictions. 

Summary 

We  have  implemented  a  dynamic  holographic  memory  with  a  capacity  of  64  images  1.  During 
the  recording  process,  the  images  are  impressed  on  the  image  beam  with  a  binary  ferroelectric 
modulator  of  128x128  pixels.  The  holograms  are  multiplexed  in  the  same  volume  of  a  BaTi03 
photorefractive  crystal  by  using  the  deterministic  phase  encoding  of  the  reference  beam^.  In  that 
technique,  each  of  the  N= 64  images  is  recorded  by  interfering  the  image  beam  with  all  N= 64 
reference  beams.  The  same  N  reference  beams  are  used  to  record  all  images.  However,  the  set 
of  relative  phases  (0  or  n)  between  the  reference  beams  and  the  image  beam  is  changed  for  each 
image.  It  represents  the  image  address.  These  images  are  reconstructed  by  reading  out  the 
holograms  with  the  whole  set  of  reference  beams  carrying  the  phase  code  corresponding  to  the 
image  one  wants  to  retrieve.  In  theory  if  the  phase  codes  are  well  chosen2.5  (j.e.  if  the 
crosscorrelations  of  the  different  sets  of  reference  beams  are  zero)  the  images  can  be  retrieved 
without  any  crosstalk.  Experimentally  a  given  amount  of  crosstalk  arises  from  imperfections  of 
optical  components  and  especially  from  the  non  equal  intensities  of  the  reference  beams  and 
from  imperfect  0-7t  phase  shifts. 

We  have  optimized  our  set  up  trying  to  reduce  these  sources  of  crosstalk  as  follows.  The  64 
reference  beams  of  equal  intensities  are  produced  by  splitting  a  single  beam  into  64  beamlets 
with  a  computer  generated  fan  out  hologram.  It  was  realized  by  King's  College  London  in  the 
frame  of  the  POP  AM  project^.  All  intensities  are  equal  within  0=8%.  These  64  beams  are 
phase  modulated  by  the  64  pixels  of  a  spatial  light  modulator.  In  order  to  achieve  very  accurate 
0-7i  phase  shifts,  we  implemented  the  phase  modulation  from  a  polarization  modulator^.  We 
utilized  a  binary  ferroelectric  liquid  crystal  SLM  acting  as  an  array  of  64  half  wave  plates.  The 
input  beams  are  linearly  polarized  along  die  bisector  of  the  two  possible  states  of  the  fast  axis  of 
the  liquid  crystal.  After  passing  through  it,  the  beams  have  one  of  the  two  possible  linear 
polarizations.  An  output  polarizer  set  at  90°  of  the  input  polarization  state  (i.e.  at  90°  of  the 
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bisector  of  the  two  output  polarization  states),  transforms  the  polarization  modulator  into  a 
phase  modulator.  Because  the  polarizer  can  be  precisely  adjusted,  an  exact  phase  shift  is 
obtained.  In  our  set  up,  the  phase  shifts  are  equal  to  O-jt  with  an  error  less  than  6.1  O'4  radian. 
The  fast  random  access  time  of  this  modulator  allows  to  retrieve  any  of  the  images  in  150ps. 

We  completely  characterized  our  set  up.  For  instance,  we  found  that  the  crosstalk  in  one  image 
due  to  all  other  63  images  is  always  lower  than  1:20.  Examples  of  images  transmitted  by  the 
crystal  before  storage  and  the  same  images  retrieved  after  storage  are  shown  in  the  figure. 

We  will  discuss  in  more  details  the  influence  of  set  up  imperfections  on  the  memory  capacity  by 
comparison  of  theory  and  experiments  conducted  on  our  set  up  on  which  we  have  intentionally 
added  some  defects. 


Transmitted  (left)  before  storage  and  retrieved  (right)  images  (2  among  64). 
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Abstract 

A  intense  photostimulated  luminescence  (PSL)  at  about  420 
nm  was  observed  when  the  UV-light  irradiated  Eu-doped  KC1 
(KCl:Eu)  phosphor  was  stimulated  with  580  nm  light  at  room 
temperature  (RT)  .  The  PSL  phenomenon  in  KCl:Eu  phosphor  is 
applicable  to  an  erasable  and  rewritable  optical  memory. 


A  new  type  of  optical  memory  based  on  the  photostimulated 
luminescence  (PSL)  phenomenon  in  electron  trapping  phosphor 
materials  for  optical  storage  have  been  studied  in  the  fields 
of  optical  associative  memory,  optical  parallel  Boolean  logic 
operations  and  optical  neural  networks.  The  electron  trapping 
phosphor  materials  can  emit  different  output  photons  that 
correlate  spatially  in  intensity  with  input  photons.’  2) 
Consequently,  the  phosphor  materials  can  be  used  to  store 
optical  information  as  trapped  electrons  and  the  information 
stored  can  be  read  out  by  a  laser  beam  scanning  of  the 
phosphor  materials.  The  unique  features  of  the  electron 
trapping  phosphor  materials  that  exhibit  the  PSL  phenomenon 
provide  the  potential  for  high  bit  storage  densities,  high 
data  transfer  and  fast  recovery  speeds.  Important 
characteristics  of  a  good  electron  trapping  phosphor  materials 
for  optical  memory  are  high  PSL  brightness  for  low  noise, 
short  luminescence  lifetime  for  minimum  readout  time  and  low 
light  scattering  for  high  bit  storage  densities.  Especially, 
the  electron  trapping  phosphor  materials  using  transparent 
single  crystal  or  thin  film  provide  an  efficient  PSL  and  low 
light  scattering. 

As  a  results  of  surveying  many  possible  transparent 
phosphor  materials  such  as  alkali  halide,  II  —  VI  compound  and 
oxide  phosphors  in  order  to  obtain  a  novel  electron  trapping 
phosphor  material  with  high  PSL  brightness  and  low  light 
scattering,  we  found  that  transparent  Eu-doped  potassium 
chloride  (KCl:Eu)  crystals  exhibit  an  efficient  PSL  for 
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optical  stimulation  with  visible  light  after  ultraviolet  (UV) - 
light  excitation.  In  this  work,  we  report  PSL  characteristics 
of  KCl:Eu  storage  phosphor  for  optical  memory. 

An  intense  PSL  peak  at  about  420  nm  was  observed  when  the 
UV-light  irradiated  sample  was  stimulated  with  560  nm  light  at 
room  temperature  (RT) .  A  typical  PSL  emission  spectrum  (solid 
line)  as  well  as  stimulation  spectrum  (dashed  line)  for  the 
420  nm  PSL  peak  from  240  nm  UV-ray  irradiated  sample  is  shown 
in  Fig.1.  Excitation  spectrum  (dotted  line)  for  the  420  nm 
PSL  peak  is  also  shown  in  Fig.1.  Using  the  PSL  phenomenon  in 
KC1 : Eu  phosphor,  stored  UV  (KrF  laser)  write-in  information 
can  be  read-out  using  visible-emitting  laser  such  as  He-Ne  or 
Ar+  ion  laser. 
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Fig.  1 
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We  have  developed  a  new  holographic  associative  memory  (HAM)  that  uses  learning 
patterns  derived  from  an  adaptive  learning  rule.  The  principle  of  learning  pattern  method 
(LPM)  and  simulation  result  cf  the  present  HAM  are  presented. 


1.  Introduction 

Although  the  optical  implementation  of  an 
associative  memory  such  as  the  Hopfield  model  can 
be  done  easily  with  vector-matrix  multiplications1  or 
holographic  systems2"3,  the  performance  may  be 
limited  due  to  the  practical  difficulties  of  having 
pseudo-orthogonality  in  memory  patterns  and  the 
existence  of  undesirable  stable  states.  The  LPM 
utilizes  the  high  performance  of  adaptive  learning  by 
using  the  simple  outer-product  learning 
implementation  in  HAM. 

2.  Principle  of  LPM 

Let  bm  be  the  m-th  memory  pattern  to  be  stored,  then 
the  error  function  E  is  defined  as 

<0 

^  «  ij 

where  om and  tm  is  the  actual  output  state  and  its 
desired  state,  respectively. 
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We  have  adopted  gradient-descent  algorithm  to  get 
the  optimal  interconnection  weight  W. 

m  sw„u 

(4) 

The  above  algorithm  is  modified  so  as  to  obtain  the 


Fig.  I  Three  two-dimensional  memory  patterns 


interconnection  matrix  of  discrete  and  positive 
elements.3  Then  the  matrix  can  be  expressed  as  the 
liner  combination  of  new  matrices  Ts  as  follows: 

r=f>,r=fxu'(v’,r  (s) 

*=i  i=i 

where  as  is  the  constant  coefficient  and  the  matrix 
7*  is  the  outer  product  between  learning  vectors 
usand  vs  with  binary  elements  (1,0). 


3.  Holographic  Implementation  of  LPM 

The  three  two-dimensional  binary  patterns  L,  P,  and 
M  are  selected  as  the  memory  patterns  as  shown  in 
Fig.  1 ,  and  they  can  be  written  in  the  form  of  25(5x5)- 
bits  vectors  as  follows: 

(L) :  b]  =(1000010000100001000011111), 

(P) :  b2  = (1 1 1 1010001 1 1 1 101000010000), 

(M) :  63  =  (10001 1 101 110101 10001 10001). 


2  1  1  1  02000  1  2  II  I  0200002  I  11  I 
0333000000030300000000000 
0333000000030300000000000 
0333000000030300000000000 
0000303030000030000300000 
21  I  102000121  I  10  2  000021  1  I  I 
0000303030000030000300000 

ooooooooooooooooooooooooo 

0000303030000030000300000 
0222202024024220000200000 
2  111  020001  2  1  1  1  0200002  I  1 1 1 
0333000000030300000000000 
0222202024024220000200000 
0333000000030300000000000 
0000303030000030000300000 
21  I  1  020001  21  1  102000021  I  I  1 

ooooooooooooooooooooooooo 

ooooooooooooooooooooooooo 

ooooooooooooooooooooooooo 

0000303030000030000300000 
21  I  102000  1  21  I  102000021  1  l  I 
0000000000000000000005550 
0000000000000000000005550 
0000000000000000000005550 
0000202020000020000203335 

Fig.  2  The  interconnection  matrix  W  obtained 
from  the  adaptive  learning  rule. 
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a,  =  I  a2  =  1  a3  =  3  a4  =  3  a5  =  2  a6  =  2  a7  =  5  ag  =  2  a9  =  1  aia  =  2 
Fig.  3  Ten  learning  pattern  pairs  (vs,u  /  and  their  coefficients  as  (s=l,2 . 10). 


Figure  2  illustrate  the  25x25  interconnection  matrix 
obtained  by  using  the  algorithm  in  Sec.  2.  We  can 
derive  ten  pairs  of  learning  pattern  pairs  (vs,us)  and 
their  coefficients  as  out  of  the  matrix  W  after  simple 
algebra  (Fig.3).  As  shown  in  Fig.  4,  the  interference 
patterns  between  the  collimated  beam  passing 
through  the  pattern  vs  and  the  scattered  beam  from 
the  pattern  us  and  ground  glass  construct 
holographically  the  outer-product  between  vs  and 
i:*.M  The  coefficients  as  controls  the  exposure  time. 
The  energy  function  U  is  defined  as  follows: 

U  =  (6) 

1  IJ  kl 

where  N0  is  the  number  l's  of  input  6 'and  X  is  a 
constant.  It  is  important  to  use  appropriate  value  of 
X.  because  the  stable  state  of  energy  and  the  input- 
dependent  threshold  level  depend  on  it.  Figure  5 
shows  the  simulation  results  of  the  LPM  for  the 
various  values  of  X.  The  recognition  probability  is 
defined  as  the  number  of  correction  recognition  per 
total  number  of  the  inputs.  The  simulations  were 
repeated  over  300  randomly  generated  inputs  of  each 
Hamming  distance  (0-15),  and  the  results  are 
averaged  for  the  three  memory  vectors  L,  P,  and  M. 
We  could  get  best  results  in  the  range  of 
0.6<X<0.7.  Figure  6  represents  one  of  the 
correctly  reconstructed  output.  In  summary,  all 
memory  patterns  can  be  stored  in  stable  state  when 
we  use  these  learning  patterns  using  LPM. 
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Fig.  4  Schematic  diagram  of  recording  the  outer- 
product  of  learning  patterns  holographically. 
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Fig.  5  Recognition  probability  versus  Hamming  distant 
of  input  patterns  with  various  constant  scale  factors  X. 
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Fig.  6  A  result  of  recognizing  the  erroneous 
input  pattern. 
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Abstract 

An  optical  associative  memory  with  bipolar  edge-enhanced  feature  learning 
by  using  a  ferroelectric  liquid  crystal  spatial  light  modulator  and  a  barium 
titanate  crystal  is  presented.  A  high  discrimination  ability  of  input  stimuli 
is  achieved. 


SUMMARY 

In  an  associative  memory  the  recalled  output  can  be  understood  as  the  weighted  sum¬ 
mation  over  the  stored  patterns.  The  weight  is  determined  by  the  similarity  of  the 
input  stimulus  and  the  stored  patterns.  To  achieve  a  better  association  we  increase 
the  wanted  weight  and  suppress  the  unwanted  ones.  We  can  find  the  same  objective 
in  optical  correlation  pattern  recognition  where  we  seek  better  discrimination  ability 
of  a  correlator.  Therefore  techniques  used  in  pattern  recognition  can  be  applied  to 
the  optical  implementation  of  associative  memories.  The  phase-only  filter  (POF)  has 
been  known  for  its  optimal  discrimination.  Investigations  revealed  that  the  magnitude 
squared  of  the  impulse  response  of  the  optical  correlation  system  using  a  POF  is  the 
edge  outline  of  the  original  reference  and  the  histogram  of  the  impulse  response  consists 
of  both  positive  and  negative  values;  the  positive  part  lies  predominantly  inside  the 
outline  while  the  negative  part  lies  predominantly  outside  the  outline.  According  to 
this  property  of  the  POF  filter  we  can  produce  ternarily  valued  edge-enhanced  versions 
of  the  patterns  to  be  stored  by  the  associative  memory.  In  the  bipolar  edge-enhanced 
version  of  a  pattern,  pixels  are  assigned  values  of  1  and  —1,  respectively,  if  their  po¬ 
sitions  in  the  original  pattern  are  adjacent  to  the  outline  of  the  object  and  are  inside 
and  outside  the  object,  respectively,  while  the  other  pixels  are  given  the  value  of  0. 
The  bipolar  edge-enhanced  pattern  has  nearly  equally  distributed  positive  and  negative 
pixels  besides  the  zero  valued  pixels,  which  meet  the  equal-distribution  requirement  for 
an  associative  memory. 

We  set  up  sun  optical  associative  memory  which  uses  a  barium-titanate  photorefrac- 
tive  crystal  as  the  memory  recording  medium,  a  ferroelectric  SLM  as  the  programmable 
input  device  and  a  charge  coupled  device  camera  as  the  output  detector.  The  use  of  a 
photorefractive  crystal  enables  the  interconnect  hologram  to  be  recorded  into  the  mate¬ 
rial  in  real-time,  which  also  introduces  reconfigurability  into  the  system.  The  use  of  the 
SLM  permits  a  programmable  system.  A  33  x  33  fanout  computer  generated  hologram 
is  used  to  project  the  laser  beam  onto  the  SLM.  The  use  of  a  CGH  in  this  experiment  is 
two-folded.  One  is  to  pixelate  the  pattern  displayed  on  the  SLM  in  order  to  increase  the 
displaying  contrast  ratio.  The  other  purpose  is  to  take  the  function  of  a  ground  glass 
as  used  in  some  reports.  In  the  optical  system  performing  the  bipolar  edge-enhanced 
feature  learning,  both  the  original  patterns  and  their  bipolar  edge-enhanced  versions 
are  displayed  on  a  binary  SLM  with  the  edge  enhanced  and  the  original  corresponding 
to  the  input  and  output,  respectively.  The  ternary  modulation  is  realized  in  the  fol¬ 
lowing  way:  Three  pictures,  A,  B  and  B’,  respectively,  are  simultaneously  displayed  on 
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the  SLM.  The  fan-out  beams  of  the  CGH  illuminate  patterns  A  and  B.  Beams  passing 
through  B  will  be  reflected  and  coinddedly  imaged  pixel-wise  onto  B’.  Polarizers  are  so 
placed  that  light  passing  through  A  and  B  is  binarily  amplitude  modulated  while  light 
passing  back  through  B’  is  binarily  phase  modulated.  Therefore  light  passing  through 
both  B  and  B’  is  ternarily  modulated.  The  light  interferes  at  the  crystal  and  forms 
a  volume  hologram  within  it.  Patterns  are  memorized  by  being  input  into  the  SLM 
in  turn.  For  optical  demonstration,  flve  13  x  13-pixel  Chinese  characters,  four  as  the 
memory  patterns  and  one  as  the  false  input  stimulus  are  chosen. 

For  associative  recall  of  an  original  pattern,  its  incomplete  version  or  the  edge- 
enhanced  version  can  be  used  as  the  input  stimulus,  in  place  of  B  or  B\  The  weights  of 
the  weighted  output  are  determined  by  the  inner  products  of  the  input  stimulus  and  the 
corresponding  edge-enhanced  patterns.  If  one  of  the  stored  memories  has  the  largest 
similarity  with  the  input  the  weight  for  that  stored  pattern  will  be  much  larger  than 
the  other  weights.  Hence  high  discrimination  amongst  the  stored  patterns  is  achieved 
and  then  the  association  performance  is  improved.  We  have  calculated  the  weights 
for  the  original  bipolar  Hopfleld  model  (BH),  where  the  input  stimulus  takes  bipolar 
binary  form,  and  the  corresponding  values  for  the  bipolar  edge  enhanced  model  (EE). 
An  average  improvement  of  discrimination  from  73%  (BH)  to  80%  (EE)  is  achieved. 
Here  discrimination  is  defined  as  the  average  ratio  of  difference  in  the  weights  of  an 
expected  memory  and  an  unexpected  memory  to  the  weight  of  the  expected  memory 
which  is  used  as  the  input  stimulus.  It  is  equivalent  to  a  high  order  nonlinear  system 
with  an  order  of  1.3.  There  is  a  dramatic  drop  in  the  output  energy,  normalized  to  the 
input  energy,  when  a  nonstored  pattern  is  input  into  the  system.  This  implies  that  the 
system  has  a  strong  discrimination  among  the  stored  and  nonstored  patterns.  It  also 
suggests  that  this  phenomenon  can  be  used  as  a  measure  to  decide  whether  the  input 
is  a  memory  of  the  system.  Partial  information  addressability  is  also  investigated.  The 
results  show  that  the  system  can  also  perform  the  partial  association  except  for  a  slight 
decrease  of  the  output  signal  to  noise  ratio.  When  the  edge-enhanced  versions  are  used 
as  the  input  stimuli  the  output  is  better  than  in  the  first  two  cases. 

We  also  perform  a  computer  simulation  to  evaluate  the  retrieval  error  of  the  pro¬ 
posed  learning  algorithm.  Most  of  the  performance  evaluation  of  a  neural  network  is 
statistically  carried  out  by  computer  simulation  where  the  stored  patterns  are  randomly 
generated.  But  a  random  pattern  might  be  meaningless  in  practical  applications,  where 
most  of  the  patterns  involved  are  obtained  from  the  real  world.  So  the  storage  capacity 
will  be  lower  than  the  simulation  prediction.  Here  we  choose  the  frequently  used  Chinese 
character  set,  which  consists  of  3755  characters  in  a  16  x  16  bitmap  format.  We  simulate 
the  relationship  between  the  retrieval  error  averaged  over  200  independent  simulations 
and  the  number  of  the  stored  patterns,  where  stored  patterns  were  selected  randomly 
from  the  3755  character  library.  We  calculate  the  cases  for  the  Hopfield  model  (HM), 
its  bipolar  counterpart,  the  BH,  and  EE.  Computer  simulations  show  the  retrieval  error 
for  EE  is  much  smaller  than  those  of  HM  and  BH.  When  the  stored  pattern  number  is 
9,  the  error  for  the  three  are  6.99%  (EE),  17.8%  (BH)  and  33.5%  (HM),  respectively. 
We  also  test  a  modified  EE  algorithm  (ME),  where  the  edge-enhancement  is  so  carried 
out  that  for  a  1-valued  pixel  its  value  will  be  reset  to  be  the  number  of  its  0-valued 
nearest  neighbour  pixels,  and  for  a  0-valued  pixel  its  value  will  be  reset  to  the  negative 
of  its  1- valued  nearest  neighbour  pixels.  With  the  same  simulation  as  above  we  find 
the  corresponding  retrieval  error  is  now  0.732%. 
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Self-routing  using  an  Pulse  Interval  Coded  Optical  Content  Addressable  Memory 

Subrat  Kar* 

Abstract 

At  multi-gigabaud  data  rates,  the  unacceptable  overheads  of  O/E-EJO  conversion 
may  be  avoided  by  self-routing  of  optical  data  streams.  We  discuss  a  scheme  by 
which  data  may  be  self-routed  through  a  optical  switch  using  an  optical  content 
addressable  memory.  The  integrated  optic  realization  of  this  element  and  its 
application  to  a  synchronous  time  slot  interchanger  are  also  discussed. 

Introduction 

A  novel  self-routed  slot-switching  scheme  using  an  integrated  optic  directional  coupler  is 
outlined  using  a  pulse  interval  coded  optical  content  addressable  memory  (PICOCAM).  The 
integrated  optic  devices  required  for  its  implementation  are  also  outlined.  This  bit-switching 
approach  avoids  die  bottleneck  in  speed  imposed  by  the  electronic  drive  circuitry  in  electn  >tic 
switches.The  scheme  achieves  bit-wise  self-routing  in  a  2x2  two-wavelength  c  i 
multiplexing  photonic  element  -  an  integrated  optic  directional  coupler  (IODC)  or  the  o- 
mode  interferometer  (TMI)  structure. 

Principle 

We  use  a  four-port  (2x2)  optical  switching  device  which  switches  its  inputs  as  a  function  of 
their  wavelength.  Thus,  input  data  presented  on  one  of  two  wavelengths  A.]  and  X2  would  cross 
over  to  the  other  waveguide  if  presented  on  wavelength  X.1  and  would  remain  in  the  same 
waveguide  if  present  on  wavelength  A2.  Therefore,  assuming  broadcast  states  as  forbidden,  the 
two-state  functionality  of  the  switch  is  preserved.  By  avoiding  the  associated  drive  circuitry,  the 
scheme  allows  data  throughput  at  near  optical  rates.  Further,  the  switching  scheme  is  inherently 
real-time  i.e.  the  response  time  of  the  switch  is  less  than  the  data  bit  width.  The  responsibility  of 
presenting  the  data  to  the  switch  with  the  appropriate  wavelength  now  devolves  on  the  input  - 
the  data  must  be  recognized  and  converted  to  the  appropriate  wavelength  before  it  is  input  to 
the  switch. 

We  assert  that,  in  any  non-faulty  rearrangeably  non-blocking  PSA,  if  the  routing  architecture 
is  known,  the  set  of  connections  T|  between  any  set  of  inputs  and  outputs  is  deterministic. 
Further,  this  assertion  implies  that  if  In  is  the  set  of  inputs  and  On  is  the  set  of  outputs,  then  for 
all  tie  In  x  On ,  the  set  of  states  S*  of  every  switch  on  is  known.  For  instance,  in  a  3x3  switch 
with  two-state  switches  (through/cross)  the  number  of  possible  connections  are  given  by  the 
following  six  sets:  SI  =  (0, 0),  (1, 1),  (2, 2);  S2  =  (0, 1),  (1, 0),  (2, 2);  S3  =  (0, 2),  (1, 0),  (2, 1); 
S4  =  (0, 0),  (U),  (2,  1);  S5  =  (0, 1),  (1,  2),  (2, 0);  S6  =  (0,  2),  (1,  1),(2, 0)  We  note  that,  for 
some  connections  (represented  by  the  set  Sy),  the  switch  is  in  the  through  state  while,  for  other 
connections  (  represented  by  the  set  Sx),  it  is  in  the  cross  state.  Furthermore,  for  each  switch, 
the  sets  Sy  and  Sx  are  predetermined  and  Sy  =  {In  e  Oj, )  \  Sx  In  switch  1,  for  instance,  Sy 
=  (SI,  S2,  S3,  S4  }  and  SX  =  {S5,  S61. 
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The  routing  function  can  be  obtained  by  the  considering  each  switch  as  a  optical  content 
addressable  memory  (CAM)  whose  output  is  1  if  it  contains  the  required  data  and  0  (null)  if  it 
does  not.  The  recognition  of  the  data  at  the  switch  is  done  using  a  pulse-interval  coded  optical 
CAM  (PICOCAM).  The  set  of  possible  destinations  D  ( i.e.  two  sets  of  destinations  for  a  single 
switch )  of  a  data  bit  is  split  into  two  sets  Sj  and  Sx  as  shown  in  the  table  below. 


Switch  No. 

ST 

SX 

I 

1,2, 3, 4 

5.6 

2 

1.4 

2,3,5,6 

3 

1,2,5 

3.4,6 

A  data  bit  bound  for  a  destination  d  €  Sj  (Case  I)  requires  the  PSE  to  be  in  the  through  state 
and  must  be  presented  on  wavelength  Xj.  Similarly,  a  data  bit  going  to  a  destination  d  e  Sx 
(Case  II)  requires  the  PSE  to  be  in  the  cross  state  and  must  be  presented  on  a  wavelength  A-2-  I 

Each  data  bit  is  encoded  in  the  PICOCAM  format  as  follows:  each  bit  is  translated  into  an 
equivalent  header  consisting  of  n+1  slots  (So,...,Sn)  of  duration  T  each  where  n  is  the  number 
of  destinations  with  a  guard  delay  E  between  each  slot  (for  a  single  switch,  n  =  2). 

We  discuss  the  realization  of  this  CAM  structure  using  fiber  optic  delay  lines.  Also  the 
monolithic  integration  of  this  device  onto  a  Lithium  Niobate  substrate  is  discussed.  The  ( 

calculated  data  rate,  therefore,  is  determined  by  the  largest  pulse  interval  ~  for  the  fiber  optic 
delay  line  structure  we  discuss,  this  is  506  picoseconds  -  which  corresponds  to  a  fixed  data 
throughput  of  1.97x10^  Hz  or  nearly  2  Ghz.  Since  this  routing  scheme  requires  no  switching  in 
the  electronic  domain,  the  throughput  expected  is  high  and  at  the  speeds  defined  largely  by  the 
optical  inputs  and  the  response  times  of  the  opto-electronic  components  used.  1 

As  an  application  of  the  PICOCAM  element,  we  also  discuss  the  implementation  of  a  self¬ 
routing  synchronous  time  slot  interchanger  based  on  the  PICOCAM  element  —  a  stream  of  time 
slots  is  manipulated  to  achieve  an  interchange  of  time  slots  in  such  a  way  that  the  i*  input  slot 
occupies  the  jfi1  output  slot  if  input  i  is  to  exchange  data  with  output  j.  The  destination  of  each 
slot  is  slot-coded  into  a  part  of  the  slot  using  spread  spectrum  codes  with  sorting  achieved 
using  coincidental  pulse  techniques.  A  suggestion  for  an  all-optical  implementation  of  the 
proposed  scheme  is  also  given. 


The  PICOCAM  element 
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Use  of  Multiplexing  Diffraction  Gratings  and  Holographic  Lens 
Arrays  in  Optical  Associative  Meaories 
V.F.Yarsolitskij,  A. I. Bogdanovich,  A.M.Polikanin,  V.I.  Polyakov 
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ABSTRACT.  The  paper  discusses  the  design  and  operation  of  an  optical 
associative  seaory  (0AM)  in  which  effective  optical  coupling  between 
aeaory  elements  is  realized  by  using  phase  diffraction  gratings  and 
holographic  lens  arrays  sade  by  specially  developed  technology. 
Experimental  results  are  presented  corroborate  a  promising  potential  of 
the  OAM  variant  under  consideration. 

One  of  the  problems  arising  in  implementing  an  optical  associative  memory 
(OAM)  structure  concerns  provision  of  an  effective  illumination  of  light- 
modulating  and  light-recording  elements  of  the  OAM  11 J.  This  study  concen¬ 
trates  upon  the  possibility  of  solving  the  above-mentioned  problem  by 
using  a  set  of  diffraction  gratings  (DG)  and  2-D  holographic  lens  arrays 
(HLA)  as  multiplexing  and  demultiplexing  elements  of  the  OAM. 

The  following  figure  illustrates  an  optical  circuit  of  the  OAM. 


llj  L13 


In  associative  search  of  information,  a  laser  beam  is  split  by  a  DGj 
into  n  rays  of  equal  intensities  and  illuminates  a  linear  modulator 
array  (LMA)  having  n  channels.  Then,  the  search-word  nodulated  rays  are 
divided  by  the  grating  DG2  into  a  rays  in  orthogonal  plane.  The  raster 
of  n  x  a  rays  (where  n  is  the  number  of  paraphasing  bits  search  words, 
a  is  the  number  of  search  words  stored  in  the  memory)  simultaneously 
illuminates  all  cells  of  an  operative  recording  medium  (0RM).  The  search 
word  code  is  compared  with  the  corresponding  inverted  code  of  the  searched 
data  in  accordance  with  the  total  mismatch  principle,  and  the  result  of 
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comparison  is  projected  to  a  linear  photodetector  array  (LPA)  by  seans  of 
an  HLA  consisting  of  n  x  a  holographic  lenses. 

The  peculiarity  of  the  optical  circuit  proposed  is  that  it  ensures  te- 
lecentrical  path  of  rays  which  minimizes  abberrations  and  gives  the 
possibility  of  arranging  active  elements  (LMA,  ORM,  LPA)  in  the  regions 
of  laser  beam  waists,  resulting  in  lower  circuit  sensitivity  to  distor¬ 
tions  in  the  light  raster  and  reducing  crosstalk. 

In  the  prototype  version  of  the  OAM,  we  used  phase  diffraction  gratings 
made  of  dichromated  gelatin  (DCG)  layers  by  a  specially  developed  method 
using  multiple  contact  replication  of  metallized  master  grating  with  20  Mm 
spacing  and  rectangular  slits.  Owing  to  nonlinearity-build  up  effect,  non- 
uniformity  of  the  energy  distribution  over  diffraction  orders  of  the 
gratings  was  within  about  10X  up  to  the  i  5th  order,  the  energy  efficiency 
of  the  grating  being  80%.  Maximum  deviation  from  equidistant  position  of 
diffraction  maxima  was  not  greater  than  4%.  The  holographic  lens  arrays 

used  in  the  prototype  was  made  on  DCG  layers  in  laboratory  conditions.  The 
2 

array  contained  10  holographic  lenses  with  average  diffraction  efficien¬ 
cy  of  60%  and  the  diffraction  efficiency  nonuniformity  over  the  array 
field  of  5%. 

Taking  into  account  losses  in  multichannel  DKDP-based  light  modulators 
12 j  used  as  LMA  and  ORM,  the  integrated  light  efficiency  of  the  proposed 
scheme  was  about  35%,  which,  in  our  opinion,  makes  it  promising  for  use  in 
optical  associative  memory  structures. 
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ABSTRACT 

Reversal  input  superimposing  technique  is  applied  to  an  optical  learning  neural  network.  Optical  neural 
networks  introduced  the  technique  are  not  necessary  to  use  negative  weights  and  subtraction,  and 
inherently  constructed  all  optical  systems. 

1.  Introduction 

In  general  neural  network  models,  the  synaptic  weights  and  the  weighted  sum  are  the  finite  real  values. 
It  means  that  some  of  them  have  minus  values.  In  die  other  hand,  to  implement  optical  neural  networks, 
usually  the  output  of  neurons  and  the  values  of  weights  are  represented  by  optical  intensities,  transmittances  or 
reflectivities.  It  is  hard  to  realize  minus  values  under  these  representations.  Therefor,  in  some  optical  neural 
network  systems,  the  twice  weights,  separated  plus  and  minus  weights  are  used.  After  each  weighted  sum 
operation,  these  values  are  subtracted  by  electronic  circuits[l,2].  In  this  approach,  twice  pixels  of  weight  matrix 
and  electronic  subtract  circuits  are  required  aid  the  system  speed  is  limited  by  these  circuits.  In  another  case,  the 
weights  are  added  with  a  bias  and  the  thresholds  of  each  neuron  are  changed  according  to  the  total  values  of 
inputs.  In  this  case  special  hardwares  to  realize  controlling  the  each  threshold  are  needed,  and  an  all  optical 
system  is  impossible. 

We  proposed  the  reversal  input  superimposing  technique  (RIST)  to  avoid  above  problems! 3, 4],  In  this 
paper,  we  report  an  optical  learning  neural  network  with  RIST.  The  optoelectronic  neural  system  based  on  RIST 
is  implemented  and  learning  capability  is  realized  by  using  a  Pockels  readout  optical  modulator  (PROM) 
devke[5]. 


2.  Principle  of  RIST 

Consider  general  discrete  neural  network  models  consisting  of  M  neurons,  which  receive  the  same  input 

signals  X  =  {xl,...fxr..,xH}  from  N  input  neurons,  and  emit  respective  output  signals  V  =  {v, . vf...,vM).  Let 

Wj,  be  the  synaptic  weight  from  the  i-th  input  neuron  to  j-th  output  neuron.  The  j-th  weighted  sum  of  input 
signals  u{  and  the  output  vjare  written  as 


and 


ui  =  Wi  *  hj  0) 

Vj-Kuj),  (2) 


where  hj  is  the  j-th  value  of  a  set  of  the  thresholding  values  H  =  (h, . hj . h^}  and  f  is  a  nonlinear  output 

function.  Introducing  a  constant  a,  Eqs.(l)  and  (2)  are  rewritten  as 


u'j  ■  I^Wjj  +  ojx,  +  alfi  -  x,)  (3) 

Vj  *  f  (u’j),  (4) 

where  f(y)  *  ify-Na).  The  thresholding  values  H  *  {hj}  are  neglected  because  they  can  be  treated  as  same  as  the 
synaptic  weights.  The  constant  a  is  a  bias  of  the  weights  w,and  is  determined  to  a  >  -minfWj),  where  minty ) 
denotes  the  minimum  values  erf  a  set  of  the  weights  W  =  {wJf}.  Hence,  the  biased  synaptic  weights  {wj(+a}  of 
the  first  term  in  Eq.(3)  are  necessary  to  be  positive.  The  input  x,  is  ranged  from  0  to  1  in  many  neuron  models. 
In  these  case,  (1-xj  of  second  term  in  Eq.(3)  are  the  reversal  values  of  the  inputs  x,  in  the  range  of  0  to  1,  and 
positive.  Under  these  conditions  all  terms  of  Eq.(3)  are  all  positive  and  easily  realized  by  optical  techniques,  f  is 


i 
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the  non  linear  function  with  die  thresholding  value  which  is  slid  the  constant  Not  from  f  in  Eq(2). 


9 .  Optical  Implementation  and  Experiments 

The  experimental  system  of  an  optoelectronic  neural  network  is  shown  in  Fig.l.  He-Nc  laser  light 
(633nm)  is  used  at  the  recalling  process.  Input  signals  of  neurons  are  polarization  encoded  by  a  liquid  crystal  TV 
(LCTV)  panel  which  has  not  an  analyzer.  The  upper  path  (PATH1)  of  the  system  performs  a  weighted  sum 
operation  which  is  the  first  term  of  Eq.(3)  with  input  signals.  The  lower  path  (PATH2)  realizes  reversal  inputs 
of  the  second  term.  And  these  calculated  results  are  superimposed  and  detected  by  a  CCD  camera.  The  balance  of 
these  signals  are  adjusted  by  the  angel  of  a  half  wave  plate  in  PATH2.  To  achieve  the  learning  capability  on  the 
system,  a  **ROM  device  is  located  in  PATH1.  The  PROM  is  written  or  erased  by  white  light  according  to  the 
values  which  displayed  on  the  LCTV.  Therefor  it  is  not  necessary  to  align  the  weight  matrix  on  the  PROM  and 
input  signals  on  the  LCTV.  The  modification  values  of  the  weights  are  calculated  from  the  teaching  signals  and 
the  recalling  results  by  a 


computer. 

On  the  experimental 
system,  8  input  and  8  output 
neurons  system  is  realized.  3 
input  patterns  of  8  bits  are 
stored  by  the  system.  The 
weights  on  the  PROM  start 
from  random  values  nearby 
the  constant  a  which  is  the 
half  value  of  the  dynamic 
range  of  the  device.  The 
learning  curves  are  shown 
in  Fig.2.  By  13  iterations, 
the  weights  went  to  right 
values  and  the  error  went  down 
to  0. 


4.  Conclusion 

Reversal  input  superimposing  technique 
is  applied  to  an  optical  learning  neural  network. 
Optical  neural  networks  introduced  the  technique 
are  not  necessary  to  use  negative  weights  and 
subtraction.  The  learning  capability  was  applied  to 
the  optoelectronic  system  using  a  PROM  device 
and  verified  at  the  performance  with  an  8-8  neurons 
network. 

The  authors  thank  S.Ishihara,  H.Yajima 
and  T-Hidaka  of  Electrotechnical  Laboratory  for 
useful  discussions  and  encouragements.  Thanks  are 
also  due  to  Y.Osugi  of  NGK  Insulators,  Ltd.  for 
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Abstract 

For  the  first  time  real-time  optoelectronic  fuzzy  control  of  an  inverted  pendulum  is  realized.  The  system 
uses  arithmetic  product -sum -gravity  method  with  Gaussian  -like  membership  functions.  Membership  functions 
and  center  -  of-  gra  vity  operations  are  realized  using  LEDs  and  position -sensing -devices. 

lJntroduction 

High-speed  optoelectronic  analog  fuzzy  inference  technique  has  been  proposedfl]  using 
beam-scanning  laser  diodes  (BSLDs)[2]  and  position-sensing-devices  (PSDs).  The  inference  process 
uses  an  arithmetic  PRODUCT-SUM  Gravity  method  with  Gaussian  membership  functions  and  the 
controllability  of  the  inference  method  is  better  than  conventional  MIN-MAX  Gravity  method  with 
triangle  membership  functions[3].  The  inference  speed  of  the  system  will  be  more  than  several  tens 
of  M FLIPS  (Mega  Fuzzy  Logical  Inference  Per  Second)  using  high-speed  operation  of  the  beam 
scanning  laser  diodes.  However,  a  real  system  control  using  the  BSLDs  has  not  been  yet  realized 
because  of  the  present  low  resolution  of  their  far-field  pattern. 

In  this  paper,  configurations  of  optoelectronic  antecedent  and  consequent  fuzzy  operation  units 
using  an  LED  array  and  a  PSD  are  proposed.  Furthermore,  the  first  optoelectronic  control  system 
using  the  consequent  unit  is  realized  and  its  usefulness  is  demonstrated  using  an  inverted  pendulum. 

2.  Optoelectronic  analog  antecedent  and  consequent  units 

Figure  1  shows  a  schematic  configuration  of  a  fuzzy  antecedent  grade  evaluation  unit,  which 
calculates  the  satisfaction  of  a  fuzzy  antecedent  rule.  In  this  figure,  1,  2  and  3  serially-wired  LEDs 
are  connected  with  the  input  in  parallel.  1,  2  and  3LEDs  start  emission  in  turn  with  increasing  the 
applied  voltage.  Because  a  PSD  can  detect  a  center -of-gravity  of  the  radiation  pattern,  Gaussian- 
like  membership  function  is  realized  by  position  and  emitting  characteristics  of  the  LEDs.  Figure 
2  shows  a  schematic  result  of  several  membership  functions.  Peak  position  of  the  function  is 
controlled  by  value  of  the  resistors  and  level-shift-diodes. 

Figure  3  shows  a  schematic  configuration  of  an  optoelectronic  fuzzy  consequent  operational 
unit.  Unified  output  membership  function  by  superposition  of  LED  radiations  and  defuzzification 
of  the  radiation  patterns  are  realized  by  an  LED  array  and  a  position  sensing  device.  Figure  4 
shows  an  example  of  defuzzification  of  the  unit.  Unequal  interval  of  each  PSD  output  reflects  the 
separation  of  each  LED  of  the  array. 

3.  Inverted  pendulum  control  system 

Figure  5  shows  a  schematic  configuration  of  optoelectronic  fuzzy  inverted  pendulum  control 
system.  A  50cm-long  free-rotating  inverted  pendulum  is  on  a  stepping-motor-driven  cart  Input 
data  for  the  fuzzy  control  are  angle  of  the  pendulum,  which  is  measured  by  a  rotary  encoder,  and 
angle  velocity,  which  is  calculated  by  the  difference  of  measured  angles.  Consequent  fuzzy 
inference  processes  are  realized  by  the  optoelectronic  operation  unit  and  antecedent  fuzzy  inference 
processes  are  simulated  by  a  personal  computer.  Figure  6  shows  schematic  experimental  results  of 
various  experimental  conditions.  Angle  of  a  controlled  inverted  pendulum  after  initial  perturbation 
(40cm/sec,  OJsec)  is  displayed  as  a  function  of  time.  Cases  of  optoelectronic  control  (0.015 
sec/cycle)  and  pure  electronic  fuzzy  control  (0.08  sec/cycle)  are  shown  in  the  figure.  Difference 
of  foe  cycle  time  are  from  introduction  of  optoelectronic  processing.  Control  gain  of  HIGH  in  foe 
figure  is  4.5  times  than  that  of  LOW.  The  results  show  foe  superior  controllability  of  foe 
optoelectronic  system. 

4. Coachisk>n 

First  real-time  control  of  an  inverted  pendulum  is  realized  based  on  an  optoelectronic  analog 
fuzzy  inference  System.  Both  antecedent  and  consequent  parts  of  foe  fuzzy  inference  is  realized 


WP34/298 


by  LED  arrays  and  position-sensing -device  arrays. 

The  authors  thank  J.Yokoyama,  M.Mori,  M.Watanabe,  T.Hidaka  and  H.Yajima  for  technical 
support  and  fruitful  discussions. 
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Abstract 

Hybrid  analogue/digital  optoelectronic  devices  allow  implementation  of  practical  optical 
neural  networks.  A  smart  pixel  design  to  perform  a  unique  learning  algorithm  is  presented. 
Network  simulations  reveal  the  advantages  of  high  contrast  receivers/transmitters/neurons. 

Motivation  for  Implementation  of  Optical  Neural  Network. 

A  major  strength  of  optical  computing  architectures  is  the  high  degree  of  parallelism  which 
can,  theoretically,  be  exploited.  However  the  actual  implementation  of  such  highly  parallel  sys¬ 
tems  can  be  problematic  with  non-uniformity  of  the  processing  elements  being  a  major  drawback. 
A  fault-tolerant  neural  processor  may  be  used  to  overcome  these  non-uniformities  at  the  cost  of 
more  individual  thresholding  elements  and  greater  in-unit  complexity.  However  the  large  fan- 
in/fan-out  requirements  of  such  networks  prevent  the  full  realisation  of  the  potential  of  the  neural 
paradigm  in  conventional  silicon  electronics.  Indeed  the  extension  of  neural  (  or  hybrid  ana¬ 
logue/digital  )  processing  to  more  complex  tasks  requires  a  novel  approach  to  algorithm  design 
that  exploits  the  dynamics  and  physical  characteristics  of  the  technology  of  the  implementa¬ 
tion.  Optics  is  a  natural  medium  in  which  to  conceive  neural  processor  design  due  to  the  high 
communication  bandwidth  available.  A  topology  for  such  a  hybrid  implementation  of  a  simple 
perceptron-like  neural  network  [1]  is  presented  in  Figure  1. 

Implementation,  of  an  Optical  Neural  Network. 

The  devices  used  in  the  neural  plane  of  this  network  are  S-SEEDs  although  a  similar  network  has 
been  simulated  using  the  NLIF  class  of  device.  [2]  The  behaviour  of  the  S-SEED  is  well  modelled 
[3]  and  is  potentially  suited  for  use  as  a  thresholding  neuron.  The  dual-rail  nature  of  the  device 
permits  dynamic  alteration  of  the  threshold  level  of  each  neuron.  The  learning  algorithm  used 
by  the  network  is  designed  to  exploit  the  dual-rail  and  dynamic  thresholding  behaviour  of  the 
S-SEED  neurons. 

The  network  presented  here  has  been  simulated  on  a  digital  array  processor  for  a  32  unit 
network  and  a  number  of  simple  problems  tackled.  These  include  pattern  recognition,  compos¬ 
ite  Boolean  functions  and  simple  arithmetic.  The  number  of  presentations  required  to  learn  a 
particular  function  is  shown  to  be  dependent  upon  the  optical  characteristics  and  dynamics  of 
the  devices  employed.  For  example  for  optimal  operation  a  higher  contrast  ratio  than  currently 
acheiv&bk  with  S-SEEDs  is  desireable,  therefore  the  basic  S-SEED  may  not  be  the  ideal  device 
to  deploy  in  the  neural  plane.  Two  avenues  currently  being  explored  to  produce  higher  contrast 
ratio  neuronal  devices  are  AFPMs  [4]  and  smart  pixel  neurons  [5]. 

Smart  Pixel  Design  for  the  S-SEED  Optical  Neural  Network. 


The  integration  of  optical  modulators  with  electronics  allows  units  to  be  designed  which  can 


WP35/300 


calculate  the  changes  in  the  weight  matrices  produced  by  the  optical  outputs  from  the  neural 
plane.  The  hybrid  technology  can  also  be  used  to  perform  the  multiplication  of  each  optical  input 
by  the  appropriate  weights.  By  utilising  an  assymmetrically  clamped  S-SGED  the  voltage  input 
into  the  circuit  can  be  linearly  dependent  upon  the  optical  power  incident  upon  the  one  of  the 
windows  of  the  S-SEED.  This  allows  the  weight  update  circuitry  to  be  simple  and  to  perform  its 
task  with  the  minimum  of  propagation  delay. 

In  general  the  simulation  of  the  complete  network  allows  us  both  to  optimise  the  algorithms 
and  to  evaluate  the  effectiveness  of  particular  device/architecture  combinations. 
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Abstract 


An  optoelectronic  implementation  of  a  multi-layer  perceptron  with  on-line  learn¬ 
ing  is  presented.  Results  obtained  from  two  demonstrator  systems  are  reported. 


Summary 

An  optoelectronic  implementation  of  a  multi-layer  perceptron  with  on-line  learn¬ 
ing  is  presented.  Results  obtained  from  a  transmissive  design  demonstrator  and  a 
simpler,  more  compact,  reflective  design  demonstrator  will  be  reported.  The  basic 
design  philosophy,  which  is  the  same  for  both  systems,  is  presented. 

Both  demonstrators  exploit  an  optical  matrix-matrix  multiplier  with  a  spatial 
light  modulator  shutter  array  and  computer  generated  holograms  and  refractive 
micro-lens  arrays  providing  the  required  fan-out  and  fan-in  functions.  The  latest 
system  consists  of  a  reflective  architecture  whereby  different  areas  of  the  same  spatial 
light  modulator  (SLM)  are  used  simultaneously  for  the  input  to  the  system  and  for 
weighting  the  interconnect,  thus  facilitating  a  significant  reduction  in  the  size  and 
hardware  cost  of  the  system  (compared  to  the  two  SLM  transmission  architecture  ). 

The  implementation  technology  places  constraints  on  the  network  model  not 
found  in  software  simulations  such  as  the  need  to  accomodate  hard  thresholds  and 
quantised  weight  values.  Unfortunately,  the  most  sucessful  learning  algorithms  in¬ 
volve  the  evaluation  of  a  gradient  which  is  not  possible  to  calculate  in  this  case. 
We  report  on  how  successful  on-line  learning  may  still  be  achieved  using  either  a 
heuristic  self-optimisation  algorithm  or  a  rigorous  stochastic  approximation  algo¬ 
rithm.  For  the  classification  of  nine  64  pixel  binary  patterns  containing  up  to  16% 
pixel  errors  using  a  system  with  16  quantised  weight  levels,  classification  accuracy 
was  increased  from  93%  to  97.6%  with  the  application  of  self  optimisation. 

Results  of  a  demonstration  of  the  system  configured  as  a  back  propagation  neu¬ 
ral  network  is  presented  and  comparisons  are  made  between  results  obtained  from 
transmissive  and  reflective  architectures. 
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Abstract  :  Both  signal  feed-forward  and  error  back-propagation  paths  for  adaptive  learning  of  TAG 
neural  network  model  are  implemented  by  a  single  ground  glass,  which  provides  much  higher  diffraction 
efficiency  and  interconnection  density. 

Introduction  :  Our  goal  has  been  to  find  solid  compact  optical  architectures  for  large-scale  implemen¬ 
tation  of  artificial  neural  networks  with  adaptive  learning  capability.  We  had  proposed  a  new  adaptive 
neural  network  architecture,  TAG(Training  by  Adaptive  Gain)[l),  which  consists  of  the  random  fixed  global 
interconnections  and  local  adaptive  gain- controls.  The  TAG  neural  network  model  is  suitable  for  large- 
scale  optical  implementation  owing  to  much  less  adaptive  elements.  The  fixed  global  interconnections  may 
be  implemented  by  multifacet  hologram[l,2],  volume  hologram[3],  or  ground  glass[4],  while  the  adaptive  1 

gains  are  done  by  spatial  light  modulators(SLMs).  The  fixed  global  interconnections  are  predetermined 
randomly  or  obtained  from  any  learning  algorithm  for  standard  patterns.  The  ground  glass  is  more  ad¬ 
vantageous  for  random  interconnections  with  much  higher  diffraction  and  interconnection  density.  In  this 
paper,  the  optical  signal  feed-forward  architecture  with  ground  glass[4]  is  extended  to  incorporate  error 
back-propagation.  Both  signal  feed-forward  and  error  back-propagation  paths  are  implemented  by  a  single 
ground  glass,  and  adaptive  learning  has  been  demonstrated  for  hetro-associative  memory.  I 

TAG  neural  network  architecture  :  Number  of  adaptive  elements  for  synapses  is  a  major  limiting 
factor  for  practical  large-scale  implementation  of  neural  networks.  As  shown  Fig.  1,  TAG  model  consists  of 
fixed  global  interconnections  and  local  adaptive  gain-controls,  which  results  in  much  less  adaptive  elements 
and  becomes  advantageous  for  image  classification  with  large  number  of  pixels.  In  mathematical  notations 
output  jfij  is  represented  as  ( 

Vij  =  S(yo )  >  Vij  =  (1) 

4,1 

where  xh  and  j/ij  denote  activations  of  kith  input  neuron  and  t'jth  output  neuron,  respectively.  S(.)  is  a 
Sigmoid  function.  To  train  the  TAG  model,  we  have  adopted  gradient-based  least-square-error  minimiza¬ 
tion  algorithm.  And  the  total  error  E  is  defined  as  E  =  Ylijivij  ~  *y)V 2,  where  s  is  an  index  for  * 

each  class  (input-output  pairs),  y  is  the  actual  state  of  an  output  neuron,  and  t  is  its  desired  state.  The 
learning  rule  is  based  on  the  error  back-propagation.  By  applying  chain  rule,  the  partial  derivatives  of  E 
with  repect  to  the  local  gain-controls  and  w*i  are  obtained  as 

If  -  £%«//»« .  =  E *.*;./»>'  ■  <2> 

where  yj;  is  argument  of  the  Sigmoid  function  in  Eq.  (1)  with  input  *J,.  The  6‘j  and  -yj,  are  output  and 
input  errors,  respectively,  and  defined  as 

%  =  (Vti  ~  WW) .  (3) 

•J  < 

The  single-layer  TAG  may  be  cascaded  for  multi-layer  architecture,  and  still  error  back-propagation  is 
applicable  for  training  of  the  local  gain-controls. 

Optical  implementation  using  ground  glass  :  For  the  fixed  global  interconnections  multifacet  holo- 
gram(MFH)  may  be  used.  However  for  very  large  scale  implementations  the  MFH  shows  limited  intercon¬ 
nection  density  and  low  diffraction  efficiency.  We  had  developed  the  optical  implementation  of  TAG  model  1 

for  only  the  forward  signal  path  using  cheap  ground  glass[4],  while  error  back-propagation  was  performed 
by  a  personal  computer(PC).  But,  learning  in  a  conventional  computer  is  often  a  very  time-consuming 
because  of  (transpose  matrix)- vector  multiplication  for  the  error  back-propagtion. 

In  Fig.2,  a  schematic  illustration  of  optical  implementation  for  single-layer  TAG  model  with  both  sig¬ 
nal  feed-forward  and  error  back-propagation  paths  using  a  single  ground  glass  is  shown.  As  described  in 


i 
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Ref.  [5],  a  single  SLM  with  multifacet  hologram  architecture  may  be  used  for  both  forward  and  back¬ 
ward  ( transpose)  path  with  N*  weighted  interconnections,  and  now  replaced  by  a  cheap  ground  glass  for 
random  interconnections.  For  forward  signal  path,  the  local  gain-control  wki  is  combined  with  input  xj,. , 
and  displayed  at  CRT  with  gray  levels.  Uniform  laser  beam  incident  at  the  mirror  side  of  the  LCLV 
generates  a  2-D  optical  patterns  corresponding  to  wtixti’a,  which  is  multiplied  with  random  fixed  glob¬ 
al  interconnections  Tiki's  by  the  ground  glass.  The  diffracted  lights  at  the  LCD  panel  plane  become 
iHj/vij  =  i  Tijtiwuxu.  When  all  pixels  of  LCD  panel  are  set  dear,  this  diffracted  lights,  reflected  by 
the  beamsplitter(BS)  and  passing  through  relay  lenses  LI  and  L2,  can  be  detected  by  Detectorl(CCD- 
Camera).  The  output  gain-controls  tty’ s,  the  sigmoid  operation,  and  calculations  of  errors  s  at  output 
neurons  may  be  performed  by  a  personal  computer.  In  case  of  error  back-propagation,  all  pixels  of  LCLV 
are  set  clear,  and  the  combined  error  ttyJy  is  displayed  at  the  LCD  panel.  The  lights  collected  by  the  De¬ 
tector  are  products  of  the  form  SijVijTijki  by  aid  of  Lens  L3  with  SLMs.  The  adaptive  local-gain  changes 
for  tt/ti  and  tty  are  implemented  by  a  personal  computer.  The  personal  computer  in  this  experiment  may 
be  easily  substituted  by  2-dimensional  arrays  of  smart  pixels  consisting  of  photodiodes  and  appropriate 
electronic  circuits.  The  simplicity  of  this  architecture  with  cheap  ground  glass  is  a  big  advantage  over 
existing  architectures  using  volume  holograms,  lenslet  array  with  SLMs,  or  multifacet  holograms. 

In  the  experiment,  to  simulate  bipolar  synapses,  difference  between  two  neuron  values  are  used.  And 
bipolar  input  patterns  are  handled  by  a  time-multiplexing  technique.  This  optical  system  is  currently 
trained  to  classify  six  5  x  7  number  patterns. 

Conclusion  :  We  have  demonstrated  the  feasibility  of  optical  implementation  for  the  signal  feed-forward 
and  the  error  back-propagation  by  a  single  ground  glass.  Provided  2-dimensional  arrays  of  photodiodes  and 
electronic  circuits  were  available,  this  TAG  with  ground  glass  may  come  up  with  compact  solid  electro-optic 
artifical  neural  networks. 
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Fig.  1  Single-layer  TAG  architecture  Fig.  2  Schematic  illustration  for  the  experiment  of  optical 

TAG  neural  network  with  signal  feed-forward  and 
error  back-propagation  path. 
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Abstract: 

A  new  architecture  combined  with  an  optical  Fourier  transform  processor 
and  a  three-layer  optical  neural  network  is  proposed.  This  hybrid  system  can 
reduce  a  number  of  neural  elements. 

Optical  neural  network  computing  is  of  interest  in  terms  of  massively  parallel  computing. 
Many  optical  neural  network  systems  based  on  the  matrix-vector  multiplication  architecture,111 
however,  are  limited  in  a  scale  of  parallelism,  because  of  spatial  resolution  of  optical  systems 
and  spatial  light  modulators.  Besides,  when  the  bipolar  synaptic  weights  are  represented  by 
optical  intensity,  they  are  divided  into  positive  unipolar  weights  and  negative  unipolar  weights'21131. 
This  means  that  twice  number  of  the  synaptic  weights  are  needed.  To  solve  these  problems,  as 
an  alternative  method,  we  introduce  optical  feature  extraction  techniques.  The  feature  extraction 
can  map  the  data  onto  a  new  feature  space.  The  data  on  the  feature  space  are  subjected  to  input 
data  for  an  optical  neural  system.  If  we  can  design  a  suitable  mapping  system,  the  ability ,  the 
function,  and  the  size  of  the  optical  neural  system  could  be  reduced. 

We  propose  here  a  new  type  of  neural  computing  with  a  preprocessor  for  f».ature 
extraction  to  reduce  the  number  of  neural  units.  A  variety  of  optical  feature  extraction  techniques 
have  been  discussed,  which  include  analog m  and  digital  methods151.  The  Fourier  transform,  the 
Hough  transform,  the  polar  coordinate  transform,  the  scale-invariant  transform  and  so  on  are 
typical  examples  of  analog  methods. 
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Fig.  1  Concept  of  a  hybrid  optical  neural  computing. 
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Figure  1  shows  a  concept  of  a  hybrid  optical  neural  network  using  the  Fourier  transform 
which  is  one  of  feature  extraction  preprocessing.  We  note  that  a  Fourier  transform  of  a  real 
object  is  shift  invariant  and  point-symmetrical.  A  half  region  of  the  Fourier  transform  is  connected 
with  positive  synaptic  weights,  the  other  region  negative  synaptic  weights.  This  procedure 
reduces  half  the  number  of  weights  to  display  on  a  spatial  modulator. 

Computer  simulation  for  the  feature  extraction  preprocessing  and  the  neural  network 
with  the  back-propagation  learning  algorithm  is  performed.  We  consider  a  three-layers  neural 
network  consisting  of  64x64  input  layer  neurons,  64  hidden  layer  neurons,  and  5  output 
neurons,  to  recognize  26  alphabet  characters  of  64x64  pixels.  At  first  we  calculate  Fourier 
transforms  of  input  images.  Because  of  point  symmetry  of  Fourier  transform  images,  the  half 
of  transformed  images  are  used.  Figure  2  shows  the  change  of  the  squared  sum  error  of  26 
characters  vs  iteration  times  of  learning  iteration  in  the  cases  when  inputs  for  the  neural 
network  are; 

(1)  original  character  images  without  Fourier  transforming, 

(2)  Fourier  transformed  images 

(3)  half  of  Fourier  transformed  images. 


We  have  estimated  the  recognition 
ability  for  non-shift  and  for  shifted 
input  images  using  the  value  of  the 
weights  and  offsets  when  the  learning 
is  completed.  Figure  3  shows  the 
error  rates  of  recognition  for  each 
input  pattern  with  and  without 
random  shift  within  4  pixels.  The 
results  indicate  reduction  of  the  error 
rates  due  to  shift  invariance  of  the 
Fourier  transform.  The  feature 
extraction  preprocessing  has  two 
advantages;  reduction  of  the  number 
of  neuron  units  and  shift-invariance 
for  input  patterns. 


Iterations 


Fig.  2  Change  of  the  squared-sum  error 
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Abstract 

We  present  an  optical  implementation  of  the  recognition  stage  of  a  Kohonen-like 
neural  network  based  on  a  supervised  learning  algorithm.  The  set-up  includes  diffractive 
optical  elements  to  both  store  and  address  the  weights. 


1.  Introduction 

Our  work  involves  the  study  of  optical  implementation  of  neural  networks. 
We  describe  the  optical  implementation  of  the  recognition  stage  of  a  neural 
network  based  on  Kohonen's  feature  maps.  Indeed,  this  stage  can  be  understood 
as  a  multichannel  correlation  and  thus  is  well  suited  to  an  optical 
implementation. 

2.  Learning  algorithm 

We  use  a  supervised  learning  algorithm,  based  on  Kohonen  self¬ 
organizing  feature  maps,  for  pattern  recognition.  While  in  the  classical  Kohonen 
algorithm  the  map  is  self  organized  by  the  inputs  according  to  their  probability 
density  function,  pattern  recognition  applications  need  some  supervision  to  map 
together  similar  stimuli  in  terms  of  their  context  and  not  in  terms  of  a  known 
vector  distance  metric.  We  adopt  here  an  idea  previously  used  for  semantic  map 
organization  [1]  adapted  to  pattern  recognition  [2].  The  basic  motivation  is  to 
organize  the  map  by  the  patterns  and  their  association  targets  simultaneously.  A 
by-product  of  this  process  is  that  the  class  labelling  of  neurons  on  the  map 
emerges  during  the  learning  phase,  while  in  the  classical  Kohonen  maps  the 
labelling  is  achieved  after  the  leamig  stage. 

3.  Numerical  studies 

The  algorithm  and  its  application  for  handwritten  digit  recognition  have 
been  numerically  implemented.  More  precisely,  two  points  relevant  to  an  optical 
implementation  were  studied.  For  classical  Kohonen  feature  maps,  the  selection 
of  a  neuron  is  made  by  minimizing  a  distance  between  the  input  and  the  weights. 
For  an  optical  version,  it  is  easier  to  maximize  an  inner  product,  which  is  strictly 
equivalent  to  minimizing  a  distance  under  the  condition  that  the  weights  and  the 
inputs  used  for  the  learning  stage  are  normalized.  Another  point  of  matter  is  to 
study  the  influence  on  the  results  during  the  recognition  stage  when  using  a 
binary  hologram  instead  of  an  actual  weights  map. 


4.  Experimental  work 
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For  our  set  up,  the  size  of  the  inputs  is  16x16  pixels  and  the  Kohonen  map 
has  8x8  neurons  so  that  2S6x64  weights  must  be  coded  using  a  hologram  of 
1024x1024  binary  pixels.  This  set  up  is  a  slightly  improved  version  of  a 
previously-used  one  for  the  implementation  of  an  associative  memory  [3,4].  It  is 
specifically  designed  to  achieve  a  multiplexing  of  connection  holograms 
involving  a  special  coding  scheme  for  the  weights  matrix,  that  we  called 
Frequency  Multiplexed  Raster  (F.M.R.).  It  leads  to  standard  spatial  filtering  set  up 
except  that  the  inputs  must  be  sampled  in  a  special  manner  and  the  holographic 
filter  contains  the  F.M.R.  coded  synaptic  array.  An  holographic  element 
(microlens  array  or  Dammann  gratings  for  instance)  enables  the  achievement  of 
the  sampling  scheme  of  the  inputs. 

This  component  and  the  connection  hologram  consist  of  a  silicon  substrate 
processed  by  traditionnal  means  :  contact-lithography  using  e-beam  designed 
masks  and  reactive  ionic  etching  (R.I.E). 

The  output  image  diffracted  by  the  connection  hologram  is  detected  using 
a  CCD  camera.  This  image  shows  the  activation  of  the  neurons.  For  the  classical 
Kohonen  algorithm,  a  special  kind  of  non-linearity  called  "winner-takes-all"  is 
used,  which  means  that  only  the  most  active  neuron  is  retained.  In  our  case,  the 
difference  between  neurons  activation  is  first  emphasized  by  quadratic 
detection.  The  decision  is  then  made  by  a  micro-computer  and  suitable  software. 

5.  Conclusion 

As  a  conclusion,  we  present  results  for  the  three  steps  of  an  optical 
implementation  :  software  implementation  for  the  learning  algorithm, 
numerical  simulation  of  the  optical  implementation  and  preliminary 
experimental  optical  implementation.  We  are  still  working  on  the  experimental 
part  of  this  work  in  order  to  improve,  in  terms  of  dynamic  range,  the  quality  of 
the  storage  of  data  for  the  connection  hologram. 
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ABSTRACT 

Optical  Neural  Network  formed  by  p  Lacing  the  holography 
correlator  into  the  linear  resonator  is  discussed. 
Variation  of  the  attractor  position  by  means  of  inhibitory 
optical  interconnections  to  achive  new  solution  types  is 
proposed.  The  experimental,  results  are  presented. 


Optical  Neural  Network  CONN)  under  consideration  consists 
of  two  neuron  layers  -  image  I  and  correlation  C,  which  are 
coincide  to  the  image  and  correlation  planes  of  Van  der  Lught 
correlator.  In  these  layers  there  are  the  phase-conjugating 
mirrors,  f orming  the  linear  resonator.  Interconnection  matrix 
which  associating  the  pair  of  the  learning  vectors  -  reference 
image  R  in  layer  I  and  6-function  in  layer  G  is  realized  as 
holography  filter. 

Usially,  this  architecture  is  considered  as  the 
bidirectional  associative  memory  only  11,21.  The  reason  for 
this  is  that  this  NN  has  unique  attractor  in  the  space  of  the 
network  states  and  this  attractor  corresponds  to  the  pair  of 
the  learning  vectors  Ill. 

However,  other  solution  types  were  demonstrated  on  this 
ONN  -  associative  processing  with  common  or  different  parts  of 
the  input  and  reference  Images  restoration  13,41.  These 
experimental  results  contradict  the  conventional  optical 
associative  memory  theory.  In  this  paper  the  approach  based  on 
the  anilysis  of  the  interconnection  massive  and  network  state 
space  is  proposed  to  explain  these  results. 

To  achive  these  new  solution  types  (except  associative 
memory)  the  position  of  the  attractor  in  the  network  state 
space  must  be  changed.  Associated  pair  R  «-»  6  must  be  replaced 
by  the  pair,  including  the  common  part  of  the  x^fTmnc»  and 
object  images  R’  in  the  layer  I.  To  do  this,  the  neurons  in 
layer  I  (I-neurons),  corresponding  to  the  vector  <R-R'),  are 
inactivated  by  the  inhibitory  interconnections  involving.These 
inhibitory  interconnections  were  absent  in  the  original 
matrix,  activating  interconnections  only  were  presented.  And 
the  interconnection  matrix  must  not  be  changed  phisically 
(i.e.  the  hologramm  must  not  be  rewrited)  because  R’  is  not 
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known  a’priori.  Therefor*,  the  matrix  changing  is  adaptive  to 
the  input  vector,  but  no  new  learning  occurs. 

The  second  vector  Cin  layer  C>  of  the  new  pair,  which  is 
associating  by  holography  filter,  is  <R«R')  and  the  inhibitory 
interconnections  from  C- neurons  corresponding  to  the  CR«R’) 
si  delobe  are  used.  But  the  interconnections  using  themselves 
is  impossible  due  to  realization  them  by  hologramm  and  this 
hologramm  must  not  be  rewriting.  That  is  why  the  problem  is 
solved  in  layer  C.  C- neuron*  activated  by  R’  only  must  be  used 
to  suppress  <R-R’),  but  it  is  Impossible  because  part  of 
C- neurons  are  activated  by  both  R’  and  B,  where  B  is  the  part 
of  input  vector,  that  B  n  R  «  0.  The  impossibility  of  the 

separation  of  the  interconnections  CUR*  from  CUB  results  in 
activating  of  the  I~  neurons  corresponding  to  B  ,  but  this 
result  is  neither  B  nor  R’,  but  the  noise  is.  This  noise 
depends  on  the  alignment  and  allocation  of  R’  and  B  in  I.  The 
number  of  activated  C- neurons  is  optimused  to  decrease  this 
noise  in  the  output  vector  by  proposed  for  this  aim  power 
criterion. 

Unlike  the  activating  interconnections  5»R,  the 
inhibitory  interconnections  are  not  invariant  to  the  angular 
displacement  of  the  restorating  beam  in  G-  layer.  This 
determines  conditions  on  the  ONN  devices  accuracy.  The  number 
of  inhibitory  interconnections  depends  on  the  number  of  the 
C- neurons  activated  by  (R’«R)  side  lobe  and  it  determines 
conditions  on  the  information  capability  of  the  processing 
images  . 

« 

The  experimental  results  of  the  restoration  of  the 
identical  part  of  images  for  various  numbers  of  activated 
C- neurons  are  presented. 

Therefore,  the  potential  for  ONN  under  discussion  is 
wider  then  only  associative  memory.  The  algorithms  of  the 
restoration  of  the  identical  and  nonidentical  parts  of  the 
images  are  possible.  These  algorithms  are  realised  by 
involving  the  inhibitory  interconnections  to  change  the 
attractor  position. 


REFERENCES 

1.  B.Kosko,  “Adaptive  Bidirectional  Associative  Memories*’, 
Apple  .Opt.,  1987,  v.26,  *23,  p.4947-4960 

2.  Y.Owechko,  “Nonlinear  Holographic  Associative  Memories", 
IEEE  Journal  of  Quant.  Electr.,  1989,  v.25,  *3,  p.619-634. 

3.  A. V. Pavlov,  E.I.Shubnikov,  “Revealing  of  Different  and 
Common  Features  in  Images  on  Holographic  Correlator",  Opt.  8 
Spectr.,  1987,  v.63,  *3,  p  .589-593. 

4.  A.V.Pavlov,  "Possible  Image  Processing  Algorithms  in 
Optical  Neural  Networks  Based  on  the  Van  der  Lught 
Correlator",  Opt.  Mem.  ft  Neural  Networks,  1992,  v.l,  *2, 
p.153-156. 


WP41/311 


OPTOELECTRONIC  SORTING  NEUROCHIP 
V.R.Grigor’ev  and  S.P.Naumov 

Scientific  Research  Institute  “Quant",  15,  4-th  Ukhachevsky  Lane, 

125438,  Mescaw,  Russia. 

Abstract  A  new  sorting  algorithm  on  the  adaptive  neural 
network  is  offered.  It  is  proved  that  algorithm  can  be 
implemented  in  one  step  irrespectively  of  the  sorted 
succession  The  features  of  3D  optical  neuralchip 
construction,  realizing  adaptive  sorting  algorithm,  are 
discussed.  A  neural  hardware  realization  complexity 
estimations  are  given. 

Summary.  Parallel  sorting  algorithm  elaboration  is  a  classical  task  which 
practical  importance  is  the  creation  of  parallel  programs  and  devices  capable 
of  providing  effective  sorting  of  big  data  massives  on  modem  and  respective 
multiprocessing  computing  systems.  In  this  connection  this  operation  on  the 
algorithms  and  hardware  level  is  very  actual. 

In  this  paper  a  new  sorting  algorithm  on  the  adaptive  neural  network  is 
offered.  The  time  and  value  complexity  is  evaluated  0(1)  and  0(n2) 
accordingly.  It  is  proved  that  die  algorithm  can  be  implemented  in  one  step 
irrespectively  of  the  sorted  succession.  As  far  as  the  time  complexity  is 
concerned  the  aquired  results  proved  the  best  in  comparison  with  the 
previously  known  results  of  the  same  volume  complexity. 

The  pecularities  of  die  three  dimensional  optoelectronic  neuralchip 
having  the  hardware  implementation  of  the  given  neural  network  algirithms 
have  been  studied.  The  practical  realization  is  oriented  to  the  three 
dimensional  integral  optic  neuralchip  advanced  manufacturing  technology, 
which  was  worked  out  at  the  Automatic  and  Electrometry  Institute  [  The 
Sibirian  Branch  of  the  Russian  Academy  of  Sience]  [1].  The  research  which 
has  already  been  made,  demonstrates  the  way  of  creating  of  multilayer 
multichannel  integral  structures  with  the  optical  communication  channels  in 
the  third  dimension  according  to  the  compatible  with  the  VLSI  technology 
inserted. 
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It  is  shown  that  realization  of  the  parallel  sorting  algorithm  is  possible 
on  the  single  chip  of  the  given  type  depending  on  the  size  of  the  photodetector 
array  (256x256  or  512x512)  256  or  512  numbers  accordingly.  The  sorting 

neurochip  throughtput  will  be  here  about  408  -  409  op/sec. 

Thus,  the  creation  of  neuralchip  on  the  three  dimensional  integral 
optical  chip  basis  will  make  it  possible  to  reduce  grately  the  time  spent  on 
sorting  in  various  applied  computer  and  information  systems.  The 
comparative  evaluation  of  the  neural  chip  parameters  with  the  submitted 
algorithm  transputer  realization  characteristics  is  applied. 
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Abstract 

Some  "self  correcting"  learning  laws  are  compared  with  photorefractive  dynamics.  Possible 
optical  implementations  of  neural  basic  units  with  photorefractive  crystals  are  proposed  and  the 
key  characteristics,  required  for  self  organizing  network  realization,  are  experimentally 
demonstrated 


Summary 

Modification  laws  of  neural  interconnects  govern  their  processing  capabilities.  The  learning 
process  is  called  "supervised"  when,  for  each  input  signal,  a  classification  code  is  known  a 
priori  and  the  learning  process  is  an  iterative  updating  of  weight  vector  components  to 
approximate  this  code.  So,  at  each  updating  step,  we  deal  with  an  error  correction  controlled  by 
an  external  "teacher"  (the  a  priori  known  code).  Some  learning  algorithms,  however,  allow  to 
get  feature  extraction  in  a  stochastic  set  of  input  signals  and  selectively  self-adapt  to  provide 
neuron  maximal  activation  for  signals  exhibiting  the  same  features.  The  learning  is  conducted 
by  comparing  the  input  signal  with  the  weight  vector  in  which  are  stored  the  common  features 
extracted  by  the  algorithm  during  the  learning  of  previously  processed  signals.  This  mismatch 
is  considered  as  an  effective  error.  The  learning  process  of  the  current  input  then  results  in  a 
correction  of  this  error  by  updating  the  weight  vector  components  proportionally  to  that 
mismatch.  So,  the  algorithm  itself  finds  the  classification  code  (i.e.  the  weight  vector 
coefficients)  from  the  real  time  inputs. 

Digital  simulations  are  mostly  used  to  get  such  kind  of  algorithms  and  only  the  simplest 
versions  of  algorithms  are  implemented  (by  means  of  discrete  time  formalism)  because  of  the 
complexity  of  the  others. 

However,  the  fundamental  dynamics  of  hologram  creation  in  some  nonlinear  optical  materials 
appears  to  be  perfectly  fitted  to  implement  these  algorithms.  In  this  work  we  show,  theoretically 
and  experimentally,  that  different  learning  algorithms  can  be  implemented  by  taking  advantage 
of  the  kinetics  of  photorefractive  holograms.  In  the  first  part  of  the  work  some  of  the  most 
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important  neural  learning  algorithms,  widely  used  in  Self- Organizing  Neural  Networks,  will  be 
described  and  demonstrated  by  digital  simulations.  The  theoretical  model  analysis  will  be 
presented  in  the  second  part  An  holographic  analogy  for  a  neural  basic  unit  will  be  described. 
In  this  scheme,  the  neural  interconnects  are  represented  by  the  elementary  holographic  gratings. 
The  weights  of  interconnects  are  the  diffraction  efficiencies  of  these  gratings  and  the  input 
signal  components  are  angularly  multiplexed  beams.  So  the  result  of  holographic  readout 
(diffracted  light  amplitude)  represents  the  neural  output 

The  dynamics  of  photorefractive  hologram  creation  will  then  be  presented  and  the  analogy  with 
the  above  mentioned  learning  laws  will  be  outlined.  Some  possible  ways  of  optoelectronic 
implementation  of  error  self-correcting  learning  laws,  based  on  the  photorefractive  dynamics, 
will  be  discussed.  The  third  part  of  the  work  describes  the  experimental  realization  of  a  single 
neural  basic  unit  based  on  these  learning  laws  and  the  comparisons  with  digital  simulations. 
The  discrete  time  formalism  is  used  in  our  experiments  similarly  to  computer  simulations.The 
mismatch  of  each  input  signal  with  recorded  earlier  information  is  detected  as  readout  amplitude 
value  and  the  n<r  iral  weights  are  then  updated  with  an  holographic  exposure,  corresponding  to 
the  measured  error.  An  electronic  circuit  through  a  personal  computer  is  used  to  generate 
stochastic  input  signals,  to  perform  the  desired  neuron  transfer  function  and  the  feedback.  The 
interconnect  updating  is  fully  based  on  the  photorefractive  dynamics.  Key  capabilities,  such  as 
competitive  learning,  selective  adaptation  and  feature  extraction  in  a  stationary  stochastic  set  of 
input  signals  are  demonstrated.  These  characteristics  are  in  good  agreement  with  the  theoretical 
models. 

A  proposal  for  a  massive  architecture  of  an  Optoelectronic  Self-Organizing  "Kohonen  Map" 
Network,  based  on  these  neural  basic  units,  and  corresponding  estimations  of  its  processing 
capabilities  will  be  presented. 
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Abstract 

The  method  of  the  replacement  of  the  neural  network  by  the  other,  equivalent  network  with 
nonnegative  or  uniform  weights  is  described.  This  simplification  yields  much  easier 
implementation  of  a  network. 


Summary 

i 

This  paper  is  concerned  on  problem  of  simulation  of  one  neural  network  by  the  other  one. 
Simulating  network  may  have  different  structure  (i.e.  different  number  of  neurons  and 
different  connection  weights)  from  the  initial  one,  but  it  proceeds  in  the  same  way  as  a  initial 
network. 

Neural  network  is  formally  defined  as  the  triple  (V,E,P),  where  V  is  the  set  of  network 
nodes  (neurons);  E:  V  x  V  ->  R  is  connection  weight  function  (i.e.  E(xvy)  is  connection 
weight  from  neuron  x  to  neuron  y);  and  P:  V  R  is  threshold  function  (i.e.  P(x)  is  a 
threshold  of  node  x). 

Simulation  is  defined  by  indication  of  correspondence  of  nodes  in  both  networks  and 
correspondence  of  instant  in  both  networks  proceedings.  This  means,  that  functions  g  i  h  are 
defined.  Function  g  establishes  correspondence  of  nodes  and  function  h  establishes 
correspondence  of  instants.  More  intuitively,  we  can  say:  the  network  S  is  simulated  by 
network  S'  if  the  condition  neuron  x  is  active  in  network  S  in  the  instant  k  if,  and  only  if 
neuron  g(x)  is  active  in  network  S'  in  the  instant  h(k)  is  satisfied  for  all  neurons  x  and  any 
instant  k. 
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Definition  of  network,  simulation  and  related  notions  are  precisely  formulated  in  the  full 
version  of  this  paper.  They  can  also  be  generalized  to  continuous  neuron  model. 

Two  theorems  are  proved.  The  first  one  shows,  that  each  neural  network  can  be  simulated 
(replaced)  by  a  neural  network  with  non-negative  connection  weights  only.  Proof  of  this 
theorem  gives  algorithm  to  construct  simulating  network.  Idea  is  based  on  the  replacement  of 
each  node  of  original  network  by  a  pair  of  two  nodes  in  the  constructed  network.  Connection 
weights  and  the  node  thresholds  are  also  appropriately  defined.  Constructed  network  has  only 
two  times  more  nodes  and  non-zero  connection  weights  that  the  original  network. 

The  second  theorem  shows,  that  each  neural  network  can  be  replaced  by  a  neural  network 
with  the  connection  weights  equal  to  zeros  and  ones.  Proof  of  this  theorem  also  gives 
algorithm  for  the  construction  of  simulating  networks.  Detailed  proof  of  both  theorem  will  be 
described  in  the  full  version  of  this  paper. 
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Experimental  investigation  of  the  performance  of  the  two-layer  neural  network  based 

on  optical  vector-matrix  multiplier. 
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Vadim  V.  Perepelitsa,  Igor  B.  Scherbakov 
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ABSTRACT 

The  paper  presents  the  obtained  results  of  learning  of  the  two-layer  (64x8)  neural 
network  (TLNN),  the  results  of  tolerant  noising  of  weight  matrixes  and  results  of 
hardware  implementation.  The  unperfectness  of  optics  satisfies  the  margin  requirements 
of  TLNN  model. 


SUMMARY 

Different  types  of  neural  networks  (NN)  are  under  careful  investigation  now. 
Creating  of  the  working  models  of  NN,  including  optical  models,  is  most  useful  and 
promising.  One  of  the  most  interesting  is  the  two-layer  NN  (TLNN),  which  can  be  easily 
made  on  the  basis  of  proposed  optical  matrix-vector  multiplier  (OVMM).  OMVM  has 
extremely  high  information  processing  capabilities.  Computing  in  such  TLNN  is 
performed  sequentially  in  OVMM,  high  performance  speed  is  available  due  to  the 
implementation  of  the  acoustooptic  spatial  light  modulator  (Bragg  cell).  Large 
time-bandwidth  product  acoustooptic  devices  are  the  fastest  up-to-date  spatial  light 
modulators. 

As  a  base  devices  for  OVMM  we  propose  to  implement  high-speed  high  dynamic 
range  spatial  light  modulator  namely  the  multichannel  multifrequency  acoustooptic  Bragg 
cell  (MAOM),  laser  array  and  CCD  (charge-coupled  device)  array  in  well-known 
orientation  of  OVMM.  Traditionally  the  DMAC  (digital  multiplication  via  analog 
convolution)  algorithm  is  used  for  obtaining  high  accuracy  in  optical  calculations.  The 
algorithm  we  propose  (fig.l)  is  similar  to  DMAC,  and  the  only  difference  is  concerned 
with  ability  to  use  frequency  coding  in  MAOS.  Due  to  this  fact  we  can  obtain  the  whole 
vector-matrix  multiplication  in  one  step  with  speed  about  frame  rate  of  MAOS.  In  this 
case  the  vector  is  inputed  into  the  laser  array  in  bit-serial  order  -  time-sequentially  bit  by 
bit  of  vector  components  in  corresponded  to  this  component  laser.  The  matrix  is  entered 
into  MAOS  -  each  element  in  each  window,  but  each  element  is  represented  in  binary 
code.  Bits  of  each  e'ement  is  frequency  multiplexed.  The  CCD  operates  in  shift-and-add 
mode.  Shift  velocity  corresponds  to  bit  loading  velocity  of  lasers.  At  the  end  of  lasers 
expositions  the  CCD  will  contain  sequence  of  data  (output  vector  components)  in 
presentation,  similar  to  DMAC.  The  supporting  electronic  (personal  or  specific  computer) 
is  responsible  for  converting  the  presentation  of  this  data  from  mixed  to  hex  format,  the 
thresholding,  the  synchronization  etc. 

The  TLNN  simulation  software  program  was  designed.  Simulated  NN  contains  64 
neurons  in  the  first  layer  and  8  neurons  in  the  second.  The  network  was  trained  with 
gradient  and  stochastic  algorithms.  The  ability  of  trained  network  allows  to  recognize 
binary  images  of  all  26  alphabet  letters.  Each  image  consists  of  8x8  pixel. 


Fig.l.  Optical  vector-matrix  multiplier. 


The  trained  network  has  stable  recognition  of  images  with  amount  of  inverted  pixels 
up  to  15-20%.  Gradient  learning  was  performed  in  300  iterations.  The  velocity  of 
stochastic  method  is  less. 

The  tolerant  noise  simulations  was  carried  out.  The  ratio  of  tolerant  noising  in 
weight  matrix  and  input  vector  was  defined.  The  recognition  ability  of  TLNN  decreases 
on  1%  for  10%  noise  ratio  in  the  TLNN  parameters.  This  fact  allows  to  cover  the 
unperfectness  of  the  OVMM  and  became  the  implementation  possible. 

The  paper  presents  mathematical  description  of  TLNN,  learning  methods  overview, 
optical  architecture  and  software  and  hardware  simulation  results. 
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Features  of  application  of  holographic  icaories 
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ABSTRACT 


The  characteristics  of  experimental  prototypes  of  disc 
holographic  memories  1,5  Gbytes  with  data  reading  rate  16 
Mbit/sec  are  given. 

The  features  of  interfacing  these  devices  and  personal 
computers  based  on  data  holographic  recording  are  discussed 
as  well  as  the  question  on  application  of  the  data  recorded 
in  holographic  disc  as  a  comand  matrix  for  neural  computer . 

SUMMARY 

The  efficiency  of  optical  memory  systems  is  determined 
higher  than  the  efficiency  of  the  traditional  methods 
using,  it  is  conditioned  by  recording  density  and 
information  capacity,  by  high  speed  recording  and  reading 
as  well  as  by  parallel  calls.  Holographic  memories  on  one 
-dimensional  holograms  realize  the  following  : 

-possibility  to  increase  by  order  or  more  the  density  of 
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data  recording  due  to  capacity  increase; 

-relative  simplicity  for  realization  of  multichannel 
systems  and,  therefore,  the  possibility  to  realize  high 
speed  recording  and  reading  the  information  from  parallel 
channel Is ; 

-increased  reliability  of  recording  based  on  holographic 
method  where  every  information  bit  "is  spread”  over  the 
entire  area  of  hologram  and,  so  the  level  of  reading  signal 
weakly  depends  on  small  defects  of  the  carrier. 

In  our  days  the  prototypes  of  devices  1,5  Gbytes  with 
information  selection  rate  16  Mbit/sec  are  constructed. 
These  parameters  are  achieved  by  using  the  unique  lines  of 
avalanhce  photodiodes  with  high  sensitivity  and  speed  as  a 
phot.odetector .  Howerer,  these  photodetectors  require  a 
specific  order  of  information  organization  caused  by 
absence  of  constant  component  in  signal.  Therefore,  the 
application  of  holographic  systems  as  a  fixsed  memory  for 
PC  and  PC  networcs  requires  the  construction  of  special 
coders  and  decoders.  The  function  specific  of  coders  is  the 
interpretation  of  income  information  into  balanced  code 
limiting  in  data  flow  the  continuous  sequence  of  nulls  and 
units  (not  more  3  units  or  nulls  in  order). 

The  similar  problem  appears  when  developing  the 
architecture  of  optoelectronic  neuralcomputer  based  on  disc 
holographic  memory  as  a  matrix  of  synaptic  links  recorded 
in  disc  can  contain  any  information.  The  developed  method 
of  information  organisation  and  constructed  coders  and 
decoders  allow  to  realize  the  neural  modulus  for  processing 
the  synaptic  matrix  as  the  information  entering  and  to 
develop  microsoftware  for  performing  adjustment  both  the 
neural  modulus  and  demonstration  task. 
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High-voltage  CMOS  array  for  liquid  crystal  on  silicon  spatial  light  modulators 

C.  C.  Mao  and  K.  M.  Johnson 
The  Center  for  Optoelctronic  Computing  Systems 
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The  design  and  test  results  for  a  64  x  64  high-voltage  CMOS  array  which  can  provide  voltages 
as  high  as  30  V  to  switch  liquid  crystals  are  described. 

L  Introduction 

Vexy-large-scale-integratioivTenoelec  trie-liquid  crystal  (VLSI/FLC)  spatial  light  modulators 
(SLMs)  have  a  great  variety  of  applications,  including  image  display,  optical  processing,  and 
optical  optical  pattern  recognition.  With  the  standard  CMOS  process,  typical  transistor  voltages 
available  to  modulate  the  liquid  crystals  are  approximately  5  V.  These  voltages  result  in  slow 
FLC  response  times,  ranging  between  50  us  to  200  us.*-3  It  is  also  difficult  to  switch  the  analog 
electtoclinic  LCs  with  such  low  voltages.  To  reduce  die  FLC  switching  times  and  activate  the 
analog  switching  chiral  smectic  electroclinic  LCs,  volatges  higher  than  typical  CMOS  voltages 
are  required  at  cadi  pixel.  In  this  paper,  we  describe  a  novel  CMOS  64  x  64  array  which  can  pro¬ 
vide  voltages  as  high  as  30  V  to  switch  liquid  crystals. 

IL  High-voltage  CMOS  array  design 

The  maximum  voltage  a  transistor  can  operate  is  limited  by  the  junction  breakdown  voltage. 
Transistor  junction  breakdown  voltages  are  inversely  proportional  to  die  impurity  concentrations 
of  their  deffusion  regions.  This  implies  that  using  a  lower  doping  concentration  for  die  transistor 
drains  should  extend  their  voltage  handling  capacity.  This  approach  is  applied  to  die  structure 
shown  in  Fig.  1.  The  n-fet  uses  the  n-well  as  its  drain,  which  has  lower  doping  than  n-diffusion, 
and  the  p-fet  uses  p-base  as  its  drain,  which  has  lower  doping  than  p-diffiision.  The  n-well  has  a 
breakdown  voltage  of  approximately  70  V  and  die  p-base  implants  breaks  down  to  die  n-well  at 
a  voltage  arround  35  V.  These  voltages  are  well  above  the  voltages  required  to  switch  2  um  of 
die  chiral  smectic  liquid  crystals.  However,  die  poly  silicon  gates  significantly  decrease  die 
breakdown  voltages.  Our  measurement  results  show  that  the  highest  voltages  the  n-fet  can  ac¬ 
comodate  is  30  V,  while  the  p-fet  voltage  is  lmwtfri  only  to  15  V.  To  make  the  circuit  provide  a 
voltage  higher  than  15  V  to  switch  liquid  crystal,  we  may  drop  the  voltage  across  several  such 
transistors  in  series.  A  high-voltage  addressing  circuit  including  four  high-voltage  p-fets  and 
two  high-voltage  n-fets  is  designed  to  provide  30  V  to  switch  FLCs.  The  voltages  applied  drop 
across  two  p-fets,  so  each  p-fet  operates  with  15  V. 

We  have  designed  and  fabricated  a  64  x  64  high-voltage  VLSI  backplane  array  to  rapidly  switch 
FLCs.  The  array,  including  bonding  pads,  is  4.4mm  x  6.6  mm.  Each  pixel  contains  one  high- 
voltage  n-fet  transistor  and  a  metal  modulating  pad.  The  pixel  is  48  um  in  x  and  y  dimensions. 
The  row  and  column  addressing  circuits  consist  of  common  CMOS  transistors  and  high-voltage 
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addressing  drcpits  as  described  in  tbe  above  paragraph.  The  high-voltage  addressing  circuits 
convert  TTL-level  voltages  to  high  voltages. 

□L  Experimental  results 

A  liquid  crystal  on  silicon  (LOOS)  high-voltage  electrically  addressed  spatial  light  modulator 
(EASLM)  was  assembled  and  tested.  The  liquid  crystal  used  is  the  British  Drug  House  smectic 
C*  ferroelectric  liquid  crystal  SCE-13.  In  the  operation,  metal  modulating  pads  in  the  pixels  are 
provided  with  25  V  or  0  V  and  the  common  electrode  on  the  cover  glass  is  applied  with  12.5  V. 
When  a  uniform  light  beam  is  incident  upon  the  device,  the  reflected  light  from  VLSI/FLC  inter¬ 
face  is  polarization  modulated,  Using  an  analyzer  in  output  optical  path  will  display  an  intensity 
image.  Fig.  2  shows  an  intensity  modulated  pattern  displayed  on  die  high-voltage  EASLM.  We 
also  tested  the  response  times  and  ON-OFF  contrast  ratio  of  the  SLM  with  an  applied  voltage  of 
25  V.  The  10%  -  90%  rise  rime  is  approximately  25  us  and  the  90%  -  10%  fall  time  is  30  us. 
Tbe  meaured  ONAJFF  contrast  ratio  for  four  rows  is  10:1. 

IV.  Conclusions 

We  have  designed,  fabricated,  and  tested  a  64  x  64  high-voltage  VLSI/FLC  spatial  light  modu¬ 
lator.  The  SLM  has  a  10%  -  90%  riae  time  of  25  us  and  a  90%  -  10%  fall  tune  of  30  us.  The 
measured  ON/OFF  contrast  ratio  is  10:1.  We  also  designed  a  VLSI  array  using  high-voltage 
transistor  inverters  instead  of  six-transistor  addressing  circuit  to  convert  TTL  signal  to  a  voltage 
of  IS  V.  The  results  will  be  presented  at  the  conference. 
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Figure  1.  Structure  of  Ugh- voltage  transistors. 


Figure  2.  Intensity  pattern  displayed  by  die 
high-voltage  VLSI/FLC  device. 
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ABSTRACT 

We  describe  Ac  selection  of  die  operating 
wavelength  of  field  effect  transistor  sdf- 
elecooopric  effect  devices  (FET-SEEDs)  in  onto  lo 
minimize  saturation  cffecti.M  well  as  to  optimize 
the  photonic  switching  circuit  figure  of  merit. 
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systems  Bring  FET-SEEDt  [1],  at  greater  than  100 
MHz  dock  rate.  These  smart  pods  incorporate 
multiple  quantum  wen  (MQW)  modulators  and 
field  effect  transistors.  Avenge  optical  powers  on 
individual  modulators,  in  excess  of  100  pW  may  be 
required  id  achieve  this  data  race,  leading  to 
sanntion  of  the  absorption  of  the  modulators.  It 
has  been  recently  recognized  that  reduced 
switching  energies  and  faster  switching  may  result 
from  using  short  pulses,  such  as  thorn  generated  by 
a  mode  locked  (ML)  laser,  instead  of  quasi- 
continaons  wave  (CW)  operation  [2],  The  use  of 
abort  pulses  may  compound  the  saturation 
problems  because  all  the  energy  is  supplied  before 
the  carriers  can  escape  from  the  wells.  We  define 
Xq  as  the  heavy  hole  wavelength  location  with  aero 
applied  field  and  Xj  as  the  shifted  heavy  hole 
wavelength  with  an  applied  field  determined  by  the 
number  of  quinine  wells  mid  the  applied 
modulator  voltage  V.  In  this  paper  we  compare 
CW  operation  at  Aq  and  A]  and  then  compare  the 
saturation  of  CW  (long  puire)  Vs  ML  (short  poise) 
operation  st  X]  so  as  to  minimize  saturation 
effects.  We  then  define  a  circuit  figure  of  merit,  to 
mintage  the  incident  optical  read  energy  required 
on  a  device  to  switch  another  down  stream  device. 

optiflifaw  thp  selection  of  oporstiB£ 
waveleng*  Xj  and  the  number  of  qnantnm  wells 
in  a  modulator  design. 
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The  absorption  coefficient  (arbitrary  units)  of 
an  AlQaAa  MQW  modulator  versus  wavelength  as 
measured  with  a  CW  tuneable  laser  at  low 
(unaatmated)  power  levels  is  shown  in  Pig.  1  and 
when  saturated  in  Kg.  2  at  two  different  voltages 
levels  [3].  This  demonstrates  that  a  modulator 
operating  st  Aq  is  degraded  by  saturation  much 
more  than  operating  at  A}  and  is  to  be  preferred 
since  (usability  is  not  required  with  FET-SEEDs. 

A  figure  of  merit  for  the  modulator  and  detector 
can  be  defined  by 

ftn  *  SI  (1  -  Raff)  (1) 
where  SR  is  the  difference  between  the  an  (high) 
sod  off  (low)  reflectivity  stales  at  A}.  In  Fig.  3  we 
plot  the  measured  Fm  versus  the  energy  in  the  ML 
pulse  which  is  the  product  of  avenge  power  with  a 
13  Ms  repetition  period  (76  MHz).  The  exciton. 

Aq,  was  drifted  to  843  nm  in  this  sample.  The  CW 
Reflection  curve  is  the  saturation  results  at  a  power 
level  such  that  the  energy  in  a  13  ps  period  is  the 
same  as  the  ML  experiment.  The  curves  in  Fig.  3 
show  the  difference  in  saturation  behavior  between 
the  situation  in  winch  the  energy  is  supplied  in  a 
period  much  greater  tfun  typical  carrier  sweep  out 
tunes  of  100  ps  (quasi  CW  case),  and  when  the 
energy  is  supplied  as  a  ML  (approximately  2  ps) 
pulre  much  shorier  than  sweep  out  times.  If  we 
speculate  on  «  smart  pixel  optical  processing 
system  operating  at  a  1j6  ns  repetition  period  with 
the  energy  inputted  in  a  25%  duty  cycle  of 400  ps, 
then  the  CW  saturation  conditions  should  be 
expected  to  prevail.  The  *0.4  ns  CW*  curve  shown 
is  a  shifting  of  the  CW  Reflection  curve  such  that 
the  same  amount  of  energy  is  integrated  in  400  ps 
to  be  compared  with  the  ML  measurement  For 
receiver  switching  energies  greater  dun  0.1  pJ,  the 
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modulator  read  power  should  be  (rater  than  0.5  pJ 
due  lo  array  to  array  coupling  losses,  tad  Fig.  3 
indicate!  that  there  will  be  some  saturation  penalty 
far  our  focaaed  beam  of  3.5  pm  diameter. 


Having  decided  on  operatkn,  it  is  useful  to 
determine  the  optimum  shift  6A,j  *  Aj  -Xq  and  the 
optimum  number  of  quantum  wells  N  each  of 
width  L,  (pm).  A  better  figure  of  merit  which 
includes  the  capacitance  of  the  MQW  detector 
which  effects  the  switching  energy  is 
FM(X1)«AnNLs«6R(l-Raff)NL(  (2) 
For  a  given  operating  voltage  V  one  can  caVralatft, 
using  our  previous  modeling  efforts  [4],  SR  and 
R^.  The  number  of  quantum  wells  N  is  adjusted 
at  each  A.  i,  as  shown  in  Fig.  4. 
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Fig.  4  Xj  (nm) 


FM(A|)  is  shown  in  Fig.  5,  at  various  fields  and 
the  corresponding  V  (voltages  VI  =  2.5  through 
V6  *  IS  volts  in  steps  of  2.5  volts).  From  these 
calculations  we  conclude  that  for  10  volt  operation 
across  the  modulators  that  SXj  =  7  nm  and  N  *  95 
maximirrs  the  figure  of  merit  FM.  Fig.  6  shows 
calculations  for  N  *  95  quantum  wells  of  Fm  (solid 
carve)  with  two  independent  experimental 
measurements  shown  as  die  broken  curves. 


Fig.  6  (nm) 
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Abstract  Photon-counting  measurements  of  carrier  sweep-out  in  biased  multiple  quantum  well 
p-i-n  diodes  are  presented .  These  results  are  compared  with  switching  characteristics  of 
symmetric -SEEDs,  which  were  measured  using  an  adapted  pump-probe  technique. 

Summary  A  key  component  of  several  opto-electronic  devices  (eg  modulators,  SEEDs, 
detectors,  etc.)  is  the  reverse-biased  p-i-n  multiple  quantum  well  (MQW)  structure.  A  funda¬ 
mental  limitation  on  die  speed  of  such  devices  is  the  carrier  sweep-out  time.  We  present 
measurements  and  analysis  of  the  carrier  dynamics  in  such  biased  modulators  using  picosecond 
time-resolved  phocoluminescence  techniques.  The  measurements  were  performed  with  an 
adapted  optical  microscope,  and  using  a  passively  Q~ switched  picosecond  laser  diode  as  the 
excitation  source.  A  small-area  silicon  single-photon  avalanche  diode  detector,  with  a  rise-time 
of  <20ps,  was  used  in  conjunction  with  the  time-correlated  single-photon  counting  technique. 
Key  features  of  this  approach  include:  (a)  die  high  spatial  resolution  possible  (>=0.5pm);  (b) 
the  detection  sensitivity  which  allows  analysis  of  carrier  densities  as  low  as  ~  1014cm'3;  and  (c) 
good  temporal  resolution,  permitting  decay  times  as  short  as  lOps  to  be  measured.  Figure  1 
shows  PL  decay  2  from  a  GaAs/AlGaAs  multiple  quantum  well  modulator  as  a  function  of  reverse 
bias  electric  field .  Numerical  modelling  of  these  decays  which  take  account  of  carrier  drift. 
Coulomb  screening,  tunnelling  and  thermionic  emission,  will  be  presented.  A  discussion  of  the 
implications  of  such  results  on  electro- absorption  saturation  will  also  be  given. 

A  notable  device  which  utilises  two  such  MQW  p-i-n  diodes  connected  in  series  is  the 
Symmetric-SEED  (S-SEED).  The  switching  speed  of  this  is  limited  by  a  combination  of  RC 
electrical  time  constants  and  die  above  MQW  carrier  transport  effects.  By  using  an  adapted 
pump-probe  technique,  the  single-photon  detection  system  was  used  to  probe  the  reflection  from 
one  window  of  an  S-SEED  and  hence  measure  its  switch  speed.  Such  a  detection  system  meant 
that  switching  times  as  short  as  ~20ps  could  be  resolved  using  this  instrument  Figure  2  shows 
an  example  of  the  dynamic  switching  characteristics  of  an  S-SEED  at  three  different  bias 
voltages.  Discussions  on  the  instrumental  performance  (eg  temporal  dynamic  range,  spatial 
resolution,  etc.)  will  be  presented,  as  well  as  the  application  of  such  techniques  to  other  SEED 
derivatives,  such  as  FET-SEEDs.  The  relevance  of  the  above  TRPL  measurements  to  S-SEED 
dynamics  will  also  be  discussed. 
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Fig.  1  Time-resolved  pbotduminescence  decays  of  a  GaAs/AlGaAs  MQW  modu¬ 
lator  under  different  bias  conditions. 
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Fig.  2  Time-resolved  switching  characteristic  of  an  S-SEED,  at  applied  reverse  bias 
of  25V,  15V  and  20V. 
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Abstract 

We  will  discuss  a  novel  high  speed  smart  pixel  structure  which  integrates  a  high 
electron  mobility  transistor  with  an  asymmetric  Fabry-Perot  optical  modulator.  Its  use  in 
an  optoelectronic  amplifer  circuit  will  be  also  described. 

Summary 

Monolithic  integration  of  electronic  components  with  optical  elements  provides  a 
means  whereby  practical  optoelectronic  data  processing  may  become  a  reality. 

Following  recent  interest  in  smart  pixels  [1, 2,3,4]  we  have  developed  a  novel  wafer 
structure  (figure  1)  which  combines  an  AlGaAs  p-i-n  detector /modulator  structure  and  a 
quantum-well  high  electron  mobility  transistor  (QW-HEMT)  structure. 

The  complete  configuration  is  that  of  a  modified  asymmetric  Fabry-Perot  modulator 
(AFPM)  allowing  high  contrast  reflection  modulation  and  efficient  photodetection.  We 
grow  the  QW-HEMT  layers  inside  the  AFPM  cavity  on  top  of  an  undoped  back  mirror. 
This  design  avoids  the  presence  of  a  p-n  junction  under  the  QW-HEMT  which  would 
otherwise  result  in  backgating  [1].  Backgating  reduces  the  output  optical  contrast  ratio 
and  gives  rise  to  a  gradual  switching  action.  It  arises  when  a  continuous  p-layer  under 
more  than  one  n-channel  device  is  held  at  a  fixed  voltage.  A  more  complex  solution 
involves  the  insulation  of  selected  areas  of  this  layer  with  proton  bombardment  and 
contact  formation  with  Be  implantation  [3]. 

The  QW-HEMT  contacts  the  quantum  well  region  directly.  A  deep  etch  uncovers  the 
QW-HEMT  surface  for  processing.  This  leaves  the  p-i-n  detectors  and  modulators  as 
mesas.  The  structure  is  flexible  in  that  it  permits  etching  of  the  front  surface  in  order  to 
adjust  the  optical  resonance.  Alternatively  it  would  be  possible  to  grow  an  additional 
front  mirror  to  increase  the  contrast  ratio  of  the  output  modulation.  This  approach  also 
facilitates  the  optimisation  of  the  high  speed  performance  of  the  AFPM  [5].  This,  married 
with  the  high  bandwidth  offered  by  the  QW-HEMT,  holds  great  potential  for  high  speed 
smart  pixels. 

The  fully  integrated  circuit  in  figure  2  demonstrates  the  basic  utility  of  this  structure. 
Both  the  detector  and  the  modulator  are  formed  from  the  same  wafer  layers.  The  QW- 
HEMT  structure  is  designed  to  be  normally-on  (depletion  mode).  The  circuit  accepts  an 
optical  input  beam  via  a  p-i-n  diode.  In  the  configuration  shown  the  diode  acts  as  a 
capacitor  being  charged  by  the  photocurrent.  The  upper  QW-HEMT  is  connected  as  an 
active  load.  The  charge  developed  in  the  diode  modulates  the  gate  voltage  of  the  lower 
QW-HEMT.  This  allows  the  gate  voltage  change  to  induce  a  larger  output  voltage  shift 
which  is  applied  to  the  underside  of  the  output  AFPM  thereby  controlling  its  reflectance. 
The  circuit  may  thus  function  as  an  optical  amplifier  since  a  low  power  input  may 
control  the  modulation  of  a  higher  power  optical  output. 
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Abstract 

We  demonstrate  a  novel  all-optical  Quantum  Confined  Stark  modulator.  The 
necessary  electric  field  is  switched-on  optically,  by  exploiting  strongly  asymmetric 
carrier  transfer  in  GaAs/AlAs  layers.  In  an  unoptimized  sample,  we  measured  9 
meV  exdton  redshift  at  850nm  with  optical  excitation  of  only  a  few  W /cm2  of 
He-Ne  laser  at  633  nm. 


Summary 

In  this  report,  we  present  preliminary  results  on  a  light  modulator  based  on 
the  Quantum  Confined  Stark  Effect  (QCSE)  [1]  but  for  the  first  time  operating  all- 
optically.  The  principle  of  operation  of  the  device  is  as  follows:  every  period  of  the 
heterostructure  contains  three  quantum  wells  (QWs)  designed  in  such  a  way  that 
following  above  bandgap  photoexcitation  a  large  fraction  of  the  photogenerated 
electrons  and  holes  tend  to  separate  and  accumulate  in  the  exterior  QWs,  creating 
a  local  space-charge  field  having  its  maximum  in  the  region  in  between  and  acting 
via  the  QCSE  on  the  exdton  resonance  of  the  central  QW. 

In  Figure  1,  we  show  a  schematic  band  diagram  of  one  period  of  the  hete- 
rostructure.  The  three  QWs  are  enumerated  from  left  to  right.  QW1  is  »  25A  of 
GaAs,  surrounded  by  a  graded  AlGaAs  (x=36  —  42%,  800A)  and  an  AlAs  barrier 
(100A).  The  combination  of  a  narrow  GaAs  QW  adjacent  to  an  AlAs  layer  is  the 
key  aspect  of  the  device.  Following  photoexdtation  resonant  with  the  AlGaAs 
barriers  or  QWl  (Xwriu  «  620nm  or  700nro,  respectively),  this  layer  combination 
functions  as  a  one-way  "quantum  filter”  for  the  photocarriers  of  QWl,  by  blocking 
the  holes  out  but  allowing  the  electrons  transfer  rapidly  into  QW2.  This  is  due 
to  the  fact  that,  for  suffidently  thin  GaAs  layer  thicknesses,  the  electron  energy 
level  at  the  T-point  in  the  GaAs  QW  is  higher  than  the  one  at  the  X-point  in 
the  AlAs  layer.  T— X  electron  transfer  times  in  the  subpicosecond  regime  have 
been  measured  in  thin  GaAs/AlAs  superlattices  [2].  QW2  is  150 A  of  GaAs  and 
QW3  100A  of  InGaAs  (x=10%).  The  electrons  that  pass  through  the  quantum 
filter  leaving  behind  holes,  are  able  to  subsequently  tunnel  through  a  thin  (<  50 A) 
AlGaAs  barrier  into  QW3.  The  resulting  space-charge  electric  field  acts  upon  the 
exdtonic  resonance  of  QW2  (Areaj  «  850nm). 

In  Figure  2,  we  show  the  transmission  spectra  of  one  of  our  samples  in 
the  region  of  the  QW2  exdton,  with  and  without  photoexdtation  by  1W /cm2  of 
He-Ne  laser  operating  at  633nm.  The  distinct  heavy  (HH)  and  light  hole  (LH) 
exdton  features  dearly  redshift  under  the  effect  of  illumination.  Redshifts  up  to 
9  meV  were  registered  with  a  few  W /cm2  of  optical  exdtation.  This  corresponds 


WPSO/l^O 


to  an  effective  electric  field  of  nearly  30  KV/cm.  Optimization  of  our  structure 
parameters  is  under  way. 

Presently:^)  Max-Planck- Institute,  Heisenbergstr.l,  70569  Stuttgart,  Germany. 
W  Ecole  Polytechnique  Federale  Lausanne,  CH  1015  Lausanne,  Switzerland. 

[1]  Miller  et  al,  PRB  32,  1043  (1985)  [2]  Feldmann  et  al,  PRB  42,  5809  (1990). 
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Figure  1:  Schematic  band  diagram 
of  one  period  of  the  heterostructure 
and  photogeneration  of  electric  field  by 
rapid  electron  transfer  from  QWl  to 
QW3.  The  QW  energy  levels  at  the  T 
point  are  denoted  by  full  lines  whereas 
the  X  point  energy  level  in  the  AlAs 
layer  by  a  dotted  line. 


Figure  2:  Redshift  of  the  QW2  heavy 
(HH)  and  light  (LH)  hole  exdtons  ob¬ 
served  in  room  temperature  transmis¬ 
sion  spectra,  with  (HeNe)  or  without 
(Dark)  photoexdtation  by  1W /cm2  of 
HeNe  laser. 
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ABSTRACT 

We  present  the  sensitivity  transition  characteristics  of  LAPS-SLM. 
Furthermore,  optical  neural  network  that  is  using  the  LAPS-SLM  as  a 
thresholding  device  will  be  presented. 

Utilizing  these  characteristics  to  the  optical  neural  network,  the  memory 
effect  similar  to  the  brain  is  realized. 

SUMMARY 

We  present  the  sensitivity  transition  characteristics  of  LAPS-SLM,  which  is 
incorporating  a  hydrogenated  amorphous  silicon  photoconductor  and 
ferroelectric  liquid  crystal  modulators,  and  its  application  for  the  optical 
neural  network. 

In  the  past,  we  reported  the  gray  scale  expression  characteristics  of  the 
LAPS-SLM  in  SPIE  Electronic  Imaging  Science  and  Technology  '91, 
moreover  we  reported  the  two  threshold  characteristics  of  LAPS-SLM  and 
its  application  for  digital  optical  processing  in  SPIE's  1992  International 
Symposium  on  Optical  Applied  Science  and  Technology. (1)  (2)  (3)  (4)  (5) 

This  time,  we  evaluated  the  sensitivity  transition  characteristics  of  LAPS- 
SLM.  Fig.  1  shows  the  relation  between  write  light  intensity  and  sensitivity 
transition.  Sensitivity  transition  of  LAPS-SLM  is  occurred  by  continuous 
writing,  and  is  controllable  by  write,  read  or  bias  light  intensity. 
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Sensitivity  transition  is  also  controllable  by  driving  voltage,  pulse  width, 
and  writing  or  neglecting  time.  Control  range  of  sensitivity  transition  is  from 
10%  to  40%  on  threshold  intensity  of  write  light. 

Fig.  2  shows  the  relaxation  after  10  minutes  continuous  action. 


Fig.  2 :  Relaxation  after  10  minutes  continuous  action 

These  characteristics  realize  the  memory  effect  similar  to  the  brain  as  it 
was  applied  to  the  optical  neural  network.  If  the  LAPS-SLM  is  used  as  a 
thresholding  device  in  an  optical  neural  network  system,  lower  of  threshold 
light  intensity  was  namely  lower  of  neuron's  excitement  threshold  level. 
Therefore  the  accumulation  effect  shown  in  Fig.1  corresponds  to  the 
memorization  in  the  function  of  brain,  because  the  repeat  of  writing  gives 
the  lower  of  threshold  light  intensity.  Similarly,  the  relaxation  effect  shown 
in  Fig.2  corresponds  to  lose  of  memory  in  the  function  of  the  brain. 

These  functions  are  realized  in  parallel  by  purely  optical  means  using 
LAPS-SLM.  LAPS-SLM  also  exhibits  higher  performance  in  the  system  as 
resolution  (over  100  Ip/mm,  max.  380  Ip/mm),  contrast  ratio  (over  400:1)  and 
frame  rate  (over  1  KHz).  Therefore,  performance  of  the  optical  neural 
network  becomes  very  high. 
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ABSTRACT 

We  quantify  die  nonuni formity  tolerance  of  FET-SEED  circuit  arrays,  show  its  dependence  on  optical  output 
contrast  and  speed,  and  describe  a  single-ended  mode  of  operation  whose  non  uniformity  tolerance  compares  well 
with  that  of  differential  operation. 


Although  a  particular  kind  of  FET-SEED  Smart  Pixel  circuit  may  have  acceptable  performance  for  a  given 
application,  one  cannot  assume  immediately  that  all  the  Smart  Pixels  in  a  large  array  can  exhibit  the  same 
performance  in  a  system.  Neither  the  circuits  nor  their  concomitant  optical  beams  can  be  identical  even  if 
to  be  so.  There  will  be  variations  or  non  uniformities  due  to  many  different  causes.  Thus  the  circuit  and  system 
designs  should  be  such  as  to,  first  of  all,  minimize  possible  variations,  then  to  minimize  the  effect  of  remaining 
variations,  and  finally,  failing  that,  to  be  tolerant  to  the  lingering  effects  of  nonuniformiries,  so  that  all  pixels  operate 
properly  under  one  global  set  of  conditions.  In  recent  years,  as  sources  of  non  uniformity  were  identified,  iterative 
redesign  has  significantly  reduced  the  size  of  variations  which  occur  in  FET-SEED  circuits  and  die  »««nriatwi 
system  hardware  [1,2].  The  use  of  ‘dynamic  latching”  greatly  dissipates  the  effects  of  many  of  the  nonuniformiries 
not  eliminated  [3,4].  The  remaining  variations  can  be  analytically  dealt  with  as  if  due  solely  to  differences  in  the 
amount  of  optical  power  in  die  beams  incident  on  the  modulators  in  the  FET-SEED  circuits,  hereafter  called  the 
‘read  beams*.  The  principal  issue  is  to  guarantee  that  the  two  possible  digital  signals  (logical  ”1*  and  ”0‘)  are 
correctly  recognized  as  such,  as  data  is  optically  transmitted  from  pixel  to  pixel,  despite  the  fret  that  the  read  beam 
powers,  Pr,  are  not  all  equal.  Recall  that  data  is  optically  transmitted  by  a  FET-SEED  circuit  when  a  read  beam 
impinges  on  an  output  modulator,  that  is  either  in  a  high  reflectivity  state,  Rm,  far  one  logic  state,  or  a  low 
reflectivity  stale.  Rlo,  for  the  other  logic  state.  The  output  signals  are  then  either  RMPr  or  Ru>Pr-  Their  ratio. 
Kq  ■  R«/Rlo.  often  called  the  modulator  output  contrast 

A  mode  of  FET-SEED  circuit  operation,  where  optical  data  encoding  is  done  with  transmitters  which  use  only 
one  modulator  is  called  single-ended.  If  Rlo  is  not  zero,  and  Kq  is  finite  (as  is  usually  the  case  for  SEED 
modulators)  then  one  can  intuitively  see  that  for  single-ended  operation,  a  nommifonnity  problem  can  arise  when 
P*  *  P4.  where  ij  denote  the  ith,  jth  pixel  in  an  array.  For  instance,  when  Pri  »  P^,  then  either  RloPb  becomes 
too  large  far  one  logic  stale  or  RmP,  becomes  too  small  for  the  other  logic  state,  as  one  tries  to  globally  adjust  all 
the  P,  powers  together.  To  date,  this  problem  has  usually  been  dealt  with  in  systems  using  FET-SEED  circuit  arrays 
by  operating  in  a  mode  where  the  data  encoding  is  differential.  Each  transmitter  has  two  modulators,  configured  to 
settle  into  complementary  states,  and  receives  two  read  beams,  Pni  and  P^.  The  two  digital  states  are  thus 
represented  by  complementary  signal  pairs  (RinPrii,  Ru>P«»)  and  (RloP»i.  RfflPh2)-  The  optical  receivers, 
already  having  dynamic  latching  far  its  benefits,  are  then  designed  to  respond  to  the  difference  in  power  in  a  signal 
pair,  so  that  the  two  logic  states  are  registered  by  (R»Pai  -  Rlo  Friz)  or  (RujP.ii  -  RaP**)-  If  there  is  local 
uniformity  (where  Prii  =  P,a  =  P»,  P«g  =  P$2  =  Pq)  and  only  global  non-uniformity  (P„  *  Pq),  then  in 
differential  operation  the  digital  signals  become  Pn(AR)  or  Pri(-AR);  P^(AR)  or  P^(-AR);  etc.,  where 
AR  =  Rjn  -  Rlo-  Then,  once  all  Pt  are  made  large  enough,  all  digital  signals  will  be  correctly  recognized  even  if 
Pa  »  P«j  or  vice-versa.  Thus  differential  operation  is  generally  considered  much  more  nonumformity  tolerant  than 
single-ended  operation  and  essentially  dependent  on  AR  rather  than  Ko. 

Some  of  the  assumptions  underlying  the  above  assertion  do  not  hold  steadfastly  under  close  scrutiny.  The  two 
principal  ones  are  that  there  is  local  uniformity  (Phl  =  Pn2,  etc.)  and  that  the  P/s  can  be  made  as  large  as 
necessary.  Recent  measurements  of  beam  powers  in  a  real  system  [2]  showed  that,  effectively,  the  local  and  global 
variations,  as  defined  above,  are  about  the  same.  Saturation  and  heating  effects  in  the  modulators  as  well  as  limited 
laser  source  power,  for  read  beams,  conspire  to  set  an  upper  limit  on  the  P/s.  The  consequences  of  these  facts  can 
be  quantified  with  the  following  analysis. 

When  ?  FET-SEED  receiver  has  a  switching  energy,  E,w,  to  deliver  E,.  through  a  transmitter,  a  source  laser 
must  tanmdi  an  energy,  E«.  There  two  energies  can  be  related  by  an  operation  mode’s  figure  of  merit,  (FM),  by  the 


E„  =  (FM)E«  . 


(1) 


(PM)  is  die  efficiency  with  which  launched  energy  is  converted  to  switching  energy.  This  (FM)  is  reducible  to  the 
one  discussed  in  [5].  For  differential  operation  (FM)d  is  given  by: 


(FM),  = 


q1o^(RHiPiii  ~  RboPm) 

2P r  : 


(2) 


where  Pf  is  the  avenge  laser  lunched  power  per  beam,  ctj  is  the  attenuation  factor,  due  to  system  optics,  between 
die  laser  and  a  transmitter’s  modulator  aid  is  the  attenuation  factor,  due  to  system  optics,  between  transmitters 
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Let  to  beam  powers  vary  such  that  over  N  pixels,  (l-x)P«  s  P«,  S  (l  +  x)P«  ,  (3) 

whore  x  is  a  fraction  £l.i  «  (1,  2,  •••N)aada  *  1,2. 

Alao  define  be  acmmifonnity  tolerance  margin,  M,  by  M  ■  (I+x)/(l-x) .  (4) 

(Note  that  Mbdwmio  of  the  largest  over  the  smaBest  bean  power.)  Then,  for  the  wont  caae,  which  governs  the 

array  performance,  (FM)*  =  (1-M/Kq)  .  (5) 

Thus  even  far  differential  operation,  the  contrast  ratio,  K«,  is  important,  as  was  pointed  out  before  in  [6].  Enor-fiee 
performance  is  not  possible  unless  Ko  >  M,  and  then  (FM)*  remains  small  until  Kg  »  M  (or  at  least  Ko  2  3M). 
A  very  small  (FM)j  means  a  very  large  E<  is  needed,  which  implies  a  much  reduced  data  rate,  f,  or  the  need  for  very 
high  power  laaer  beams,  since  P(  *  OB*.  In  a  system  application,  there  will  be  an  upper  available  limit  on  P*  and  a 
lowest  acceptable  limit  on  f,  setting  sn  upper  limit  on  E*  mid  a  lower  limit  on  (FM)*.  Without  a  sufficiently  large 
Kq.  the  allowable  x  and  M  became  small  and  the  nononiformity  tolerance  of  differential  operation  evaporates.  A 
similar  analysis  shows  that  standard  methods  of  single-ended  operation  fare  even  worse.  Is  there  an  alternative? 

The  answer  is  possibly  yes,  in  an  unusual  form  of  single-ended  operation,  we  call  Asynchronous  Reset  On  Every 
Bit,  or  AROEB-ic  for  short  The  operatmg  procedure  requires  pulsed  optical  dab  and  dynamically  latching 
receivers  with  a  sharp  switching  threshold.  (We  will  show  experimental  dab).  Of  the  two  types  of  digital  logic 
signal  (logical  1  or  0)  pubes,  one  should  deliver  E,*  and  the  other  no  mate  than  a  threshold  energy,  E*.  which 
causes  no  switching.  Interleaved  in  time  between  the  signal  pulses  are  informationless  reset  pulses  which  return  the 
receiver  to  a  preset  state.  Being  asynchronous  with  signal  pubes,  the  reset  pulses  create  no  contention  with  the 
signal  pubes.  This  fact  allows  the  signals  to  be  smaller  than  in  other  operating  modes,  mid  removes  the  upper  bound 
from  the  reset  pulses  that  reference  signals  would  have.  The  receiver  then  has  a  required  input  contrast  ratio.  Km, 


obviously  i  1.  such  that  Km  *  E^/E* .  (6) 

The  AROEB-ic  mode  of  operation  has  two  figures  of  merit,  (FM),t  and  (FM)^  such  that: 

s  (FM).,E«  and  E*  2  (FM),jE*  .  (7) 

For  the  worst  cases,  which  govern  array  performance, 

(FM).,  -  a,a2(l-x)RH]  and  (FM),2  =  a,o2(l+x)Ru> .  (8) 

Error  free  performance  is  not  possible  unless,  (from  eqs.  6, 7, 8) 

(FM^/FM.j  2  Ku  Le.  Kq  2  MK*,  ,  (9) 


and  (FMXu  satisfies  the  constraints  imposed  on  E*  by  P*  and  f.  Nob  that  (FMXi  exceeds  (FM)*  by  a  factor  of 
2/(  I  -  M/Kq  ).  For  receivers  with  K*]  ■  2  (which  should  not  be  impossfale),  and  with  a  not  very  large  Kq  of  4,  a 
very  respectable  M*2  is  allowed  and  (FM),i  *  4(FM)*.  So,  such  an  AROEB-ic  operation  would  be  four  fanes 
faster  than  ?  differential  mode  or  alternatively  could  be  done  with  four  times  less  laser  power.  Rom  a  different 
perspective,  joe  can  abo  deduce  that  when  a  system  application  demands  a  high  dab  rate  with  limited  laser  power, 
the  dngb  rndrd  ABOEB-k  mode  of  operation  cm  have  greater  nouunjformity  tolerance  than  the  differential 
mode  of  operation.  On  the  other  band,  with  a  very  relaxed  dab  rale  requirement,  the  differential  mode  will  work 
with  a  lower  Ko.  by  a  factor  of  K«,  than  the  AROEB-ic  mode  would. 

In  summary,  we  present  a  simple  method  of  analysis  which  quantifies  non  uniformity  tolerance  in  FET-SEED 
circuit  arrays  by  relating  it  to  easily  measurable  parameters,  so  that  reasoned  choices  can  be  made  when  such  arrays 
are  mcorponted  into  systems. 
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Abstract 

The  design  and  implementation  of  optically  interconnected  self-routing  exchange/bypass 
modules  arrays  is  described.  The  nodes  are  laid  out  using  silicon  and  gallium  arsenide  based 
technologies.  Performance  metrics  of  the  pixels  are  analysed. 

Motivation 

Sorting  remains  one  of  the  most  commonly  performed  tasks  in  any  computation  [1].  The 
parallel  implementation  of  this  function  using  space-variant  non-local  optical  interconnects  such 
as  the  perfect  shuffle  [2]  has  been  shown  to  provide  a  bonus  factor  compared  to  its  electronic 
counterpart  [3].  This  bonus  factor  could  be  increased  by  two  orders  of  magnitude  if  dedicated 
hardware  (smart  pixel)  is  provided  to  reduce  the  computational  part  of  the  sorting  [3].  Con¬ 
sequently  self-routing  exchange/bypass  modules  have  been  designed  and  laid  out  using  1.5  pm 
CMOS  and  AT&T  FET-SEED  monolithic  integration  [4].  The  smart  pixels  arrays  will  be  used 
in  a  time  multiplexed  multi-stage  interconnection  network  (MIN)  [5]  in  which  the  bitonic  sort  is 
implemented  with  perfect  shuffle  interconnects  [6]. 

The  enhanced  excht  aa  c/bypass  module  . 

The  smart  pixel  is  designed  to  latch  at  the  first  difference  in  the  significant  bit  of  the  two 
optical  input  data  streams  A  and  B.  It  remains  in  the  state  set  until  a  electrical  reset  signal  R 
is  activated  at  the  end  of  the  word.  This  ensures  that  the  output  signal  C  exhibits  either  the 
maximum  or  the  minimum  of  the  two  input  signals  depending  on  the  state  of  the  control  signal 
P.  An  additional  signal  S  may  be  activated  to  allow  the  output  data  C  and  D  to  be  the  "straight 
through”  inputs  at  certain  stages  of  the  sorting.  The  module  is  docked  by  the  signal  CLK.  The 
logical  design  of  this  circuit  is  shown  in  figure  1.  It  is  intended  to  add  to  the  logic  drcuitry  a 
storage  unit  which  consists  of  a  4-bit  shift  register. 

The  module  is  implemented  in  European  Silicon  Structures  (ES2)  1.5  pm  CMOS,  n-well  dou¬ 
ble  metal  technology  via  Eurochip  (similar  to  US  MOSIS).  This  allows  the  design  to  be  debugged 
and  the  process  to  be  demonstrated  economically.  The  layout  of  4x4  array  of  pixels  is  shown  in 
figure  2.  The  optical  interconnects  come  into  the  silicon  chip  via  metal  pads  from  a  GaAs  chip 
which  has  the  transmitters  and  receivers.  The  two  chips  will  be  flip-chip  bonded  together  [7]. 
Assuming  that  the  common  dock  and  reset  signals  can  be  electrically  connected  and  that  dual 
rail  logic  is  implemented  ten  optical  input  signals  per  pixel  are  hence  to  be  considered.  A  similar 
design  using  FET-SEED  monolithic  integration  technology  is  currently  under  study.  Electrical 
characteristics  of  a  4x4  array  of  such  pixels  will  be  presented. 
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Abstract:  The  maximum  optical  Fm/P^  ratio  of  phototnmsistor/surface  emitting  laser  smart  pixel  logic  gates 
has  been  analysed  in  terms  of  device  and  circuit  parameters.  Cascading  gate  experiments  have  been  performed. 


Heterqjunction  phototran si  stor/ vertical  cavity 
aurinoe  ■™**«wg  laaer  (HFT/VCSEL)  smart  pixel 
arrays  hold  gnat  promise  for  parallel  optoelectronic 
processing  systems  and  other  applications  [1-3].  The 
implementation  of  many  of  these  systems  requires 
beam  splitters,  holograms  end  lenses.  These  passive 
elements,  along  with  misalignment  and  beam  spread¬ 
ing,  greatly  reduce  die  magnitude  of  the  optical 
signals  as  they  propagate  from  one  stage  to  another. 
To  overcome  these  losses  and  insure  that  die  arrays 
are  caacadable,  the  HPT/VCSEL  pixels  must  have 
sufficient  optical  gain.  Unfortunately,  die  optical 
output  power  of  a  VCSEL  is  not  a  continuous  linear 
function  of  the  currant  above  threshold,  hut  rather 
reaches  a  maximum  and  then  decreases  rapidly  as  die 
drive  currant  is  increased  further.  Therefore,  logic 
gate  pixels  formed  with  VCSELs  and  either  HFTs  or 
FETs  will  have  a  maximum  gate  gain  (defined  as  the 
maximum  P^/P).  ratio)  that  depends  upon  the  optoclo- 
ctronic  gain  of  die  transistors  and  die  characteristics 
of  die  VCSELs. 

This  paper  presents  an  analysis  of  die  maxi¬ 
mum  gate  gain  of  HFT/VCSEL  logic  gates  and  the 
variation  of  this  gain  with  translator  optoelectronic 
gain  (G^),  maximum  VCSEL  power  out  (PJ, 

threshold  currant  (I*)  and  I_  (the  current  at  P _ V 

We  have  applied  load  line  analysis  and  nmnurinai 
simulation  to  determine  the  gate  gain  of  inverting, 
e.g.  NOR  and  NAND,  and  non-inverting,  e.g.  AND 
and  XOR  logic  gates  for  various  device  parameters 
(note:  the  XOR  gate  has  both  inverting  and  non- 
inverting  components),  hi  addition,  we  have  experi¬ 
mentally  demonstrated  three  cascaded  HFT/VCSEL 
stagee  and  have  set  up  a  demonstration  of  an  AND 
gate  driving  an  XOR  gate.  The  results  of  both  die 
analysis  and  experiments  will  be  presented  in  die 
paper.  Figures  la  and  lb  illustrate  die  common 
emitter  I.  vs.  V.  characteristics  of  an  HPT  with  three 
fuper-impoeed  VCSEL  load  lines,  for  inverting  end 
non-inverting  logic  gates  respectively.  Figures  2a  and 
2b  dhow  the  PM  vs.  Ph  curves  resulting  from  these 
load  lines.  These  plots  demonstrate  dud  there  is  an 
optimum  supply  voltage  required  in  order  to  achieve 
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Figure  1.  a)  Load-line  curve  for  non-inverting  gate, 
b)  Loed-line  for  inverting  gate. 
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in  Figs.  3  and  4.  Time  figure*  flfattrate  the  effect  of 
VCSEL  thrwhokl  current  on  the  gate  gain  for  the  two 
VCSEL  designs  end  «  HPT  optoelectronic  gam  of 
*  50A/W.  it  is  seen  that  dm  gate  gain  is  only  a 
weak  (taction  of  ^  and  Ana  small  changes  in  1* 
across  an  array  should  not  effect  the  over-ell  perfor¬ 
mance.  However,  it  is  also  deer  from  this  figure  that 
reducing  I*  of  the  low  efficiency  VCSEL  does  not 
significantly  increase  die  gate  gam.  Thus,  while 
reducing  1*  decreases  the  power  dissipation,  it  does 
not  increase  An  cascadabQity  of  the  array.  Even  with 
the  more  efficient  VCSEL,  die  gate  gain  does  not 
increase  dramatically  until  ^  becomes  very  small. 
S—nary;  The  gaia  of  optoelectronic  logic  pixels  has 
been  shown  to  critically  depend  npon  the  bias  applied 
to  the  circuit  and  to  be  proportional  to  the  gain  of  the 

VCSEL*.  Ttegme^wwfamdCbnntyawedc 
(taction  of  dm  VCSEL  tawhoid  bat  is  strongly 
dependent  spon  ^ -I*,  the  efficiency  of  tbe  VCSELs. 
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Figured.  Gate  gain  vs  threshold  current  for  inverting 

gates. 
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Photothyristor  smart  pixels  in  M.I.M.D. 
optical  computing  architectures 
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Vrije  Univcrateit  Brussel,  Applied  Phys.  Dept 
Pleinlaan  2,  B-1050  Brussels,  Belgium 

We  describe  the  application  of  vertical-to-surface  transition  electrophotonic  devices  to  Multiple 
Instruction,  Multiple  Data  parallel  optical  information  processing  and  in  particular  the 
development  of  smart  pixels  based  on  photothyristors  -  the  basic  elements  for  MJMD. 
machines. 

Recently  a  new  and  very  promising  generation  of  active  bistable  vertical-to-surface  transition 
electrophotonic  devices  was  developed  to  perform  chip-to-chip  optical  interconnects  -  the  PnpN 
photothyristors.  When  used  in  die  configuration  of  the  differential  pair,  these  multilayer  HI  V 
semiconductor  (GaAs)  structures  have  been  shown  to  be  capable  of  emitting  binary  optical 
signals,  receiving  50  fJ  optical  inputs  together  with  a  set-on-set-off-cycle  time  of  10  ns  [lj. 

Malting  use  of  these  exceptional  optical  signal  receiving  and  emitting  abilities,  we  will  present  a 
method  in  five  distinct  steps  to  extend  the  functionality  of  the  differential  pair  of  optical 
thyristors  with  a  Si  integrated  electronic  control  cell,  which  converts  the  incoming  optical 
signals  into  TTL-  or  CMOS-  compatible  electronic  signals  and  vice  versa.  If  digital  logic  and 
memory  is  implemented  inside  each  of  these  control  cells,  we  obtain  smart  pixels  for  Multiple 
Instruction  Multiple  Data  optoelectronic  processors,  where  different  operations  can  be  executed 
on  different  pixels  at  the  same  time. 

The  first  step  consists  of  applying  a  common  voltage  sequence  to  die  whole  array  of  differential 
pairs  of  optical  thyristors.  Dining  the  receiving  level  (0  V),  optical  inputs  create  charge  carriers 
in  the  central  junctions  of  one  or  both  thyristors  of  die  differential  pair.  At  set-on  level  (e.g.  +6 
V)  the  decision  is  made  whether  the  first  (logic  'l1)  or  the  second  (logic  V)  thyristor  received 
the  largest  optical  input  -  this  one  switches  on  while  the  other  remains  off.  At  set-off  level  (e.g. 
-10  V),  charge  carriers  are  extracted  from  both  central  junctions  by  punch-through. 

In  step  2  the  differential  pair  achieves  the  conversion  of  electronic  to  optical  signals  and  vice 
versa  is  achieved  in  step  2.  The  first  operation  requires  that  an  electronic  control  cell  is  able  to 
select  which  of  both  thyristors  is  switched  on  at  set-on  level  (independently  of  any  optical 
input),  the  latter  to  determine  which  of  both  thyristors  is  on  after  having  received  an  optical 
input  We  present  several  solutions  to  these;  problems;  one  of  these  is  the  use  of  GaAs 
transistors  parallel  to  each  thyristor,  decreasing  the  amount  of  charge  carriers  in  this  thyristor 
(electronic  to  optical  data  conversion),  together  with  a  Si  integrated  voltage  comparator  to 
determine  which  of  both  is  on  (optical  to  electronic  data  conversion).  The  GaAs  and  Si  chips 
are  electrically  tinked  to  each  other  by  adder  bumping. 

Step  3  is  the  synchronisation  of  die  common  voltage  sequences  erf  different  arrays  in  the 
system,  making  use  of  a  system  of  alternating  light  beams  to  transmit  system  clock  pulses.  At 
step  4  we  propose  an  elementary  Si-integrated  control  module  (individual  to  each  differential 
pair)  to  translate  the  set  of  required  and  measured  voltages  (at  times  given  by  the  local 
synchronization  circuit  of  the  array)  into  the  higher-level  command  lines  DATA,  SEND  and 
CLOCK.  Finally,  step  5  is  the  implementation  of  Si-integrated  digital  logic  and  memory, 
allowing  the  smart  pixels  to  exchange  and  store  information  at  application  level,  including 
shifting,  pattern  recognition,  or  general  purpose  M.LM.D.  computing. 

A  promising  approach  to  M.I.MJD.  computing  is  the  Tse  concept  [2],  which  provides  a  simple 
way  to  make  several  optoelectronic  processing  units  (several  logic  planes  of  smart  pixels)  work 
together  to  implement  the  intricate  M.I.MJD.  processing  task.  Tse'  is  the  Chinese  word  for 
'icon'  or  'small  image  element'.  And  indeed,  the  binary  images  that  can  be  processed  by  this 
machine  are  divided  into  image  elements  of  e.g.  4  bits  each.  An  instruction  image,  containing 
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the  different  4  bit  instructions  X  (e.g.  X  *  AND,  NOT, ...)  to  be  executed  on  the  different 
image  elements,  and  one  or  several  data  images  are  applied  to  all  processing  units.  If  the 
processing  unit  X  recognises  die  instruction  X  in  some  image  elements  of  the  instruction  image, 
the  instruction  X  is  executed  cm  corresponding  image  elements  in  the  data  images.  The  result 
image  is  obtained  by  combining  (OR-tng)  the  partial  results  of  the  different  processing  units 
X.YJZ,...  which  can  perform  logic  operations  as  well  as  shifting  operations,  allowing  data  to  be 
transferred  between  different  image  elements  of  a  same  image.  The  global  aim  of  these 
principles  is  to  execute  different  programs  on  different  image  elements  while  letting  these 
programs  exchange  results. 

Compatibility  of  the  optoelectronic  Tse  M.I.M.D.  concept  with  die  classic  S.I.S.D.  operation  of 
an  electronic  personal  computer  can  be  achieved  if  the  format  of  the  image  element  is  chosen 
according  to  the  registers  of  the  CPU  of  the  computer  under  consideration.  A  single  image 
element  could  then  consist  of  an  instruction  field,  data  fields,  the  status  register,  data  and 
address  registers.  A  parallel  memory  plane  (an  array  of  smart  pixels  with  memory)  is  used  to 
deliver  the  image  with  the  instructions  in  the  instruction  fields.  Each  processing  unit  (an  array 
of  smart  pixels  with  digital  logic)  compares  the  contents  of  the  instruction  fields  to  a  specific 
value  to  determine  the  image  elements  on  which  the  instruction  will  be  executed  -  which  causes 
the  register  values  of  that  image  element  to  be  altered.  Finally,  the  processing  units  put  the 
addresses  of  the  next  instructions  in  the  instruction  fields  of  die  result  image,  which  is  then  sent 
back  to  the  parallel  memory  plane,  after  which  a  new  image  with  instructions  is  sent  to  the 
processing  units,  etc.  In  addition  to  the  original  computer  instruction  set,  data  transfer 
instructions  as  well  as  global  image  processing  functions  can  be  implemented  in  a  modular  and 
flexible  way. 


RtftrtHctt  :  [1]  P.  Herein  ans,  M.  Knijk,  R.  Voimckx,  O.  Borghs,  Appl.  Pkys.  Lett.  61.  1326  (1992) 

[2]  D.H.  Schaefer,  J.P.  Strong  IE.  Tie  computers',  Proc.  IEEE  63(1),  p.  129,  1977 
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1  Abstract 

An  optoelectronic  smart  pixel  array  for  incorporation  in  an  optical  neural  network  is  considered.  The  circuit, 
based  on  the  ART  memory  of  Tsay  and  Neweombe  comprises  photodetectors,  modulators  and  normalisation 
and  winner-take-all  circuitry. 


2  Summary 


There  has  recently  been  considerable  interest  in  the  optical  implementation  Ji  adaptive  resonance  theory 
(ART)  [1]  neural  networks  [2]  [3J.  Part  of  this  interest  arises  from  the  ART  matching  metric  being  an  inner 
product  between  the  input  pattern  and  stored  templates.  This  can  be  implemented  optically,  either  as  the 
seroth  lag  in  an  optical  correlator  or  as  a  vector  matrix  multiplier  system.  However,  the  details  of  the 
ART  algorithm  call  for  normalisation  steps  and  the  selection  of  maximum  outputs  in  order  to  decide  on 
the  best  match.  Conventionally  one  would  read  the  optical  signals  out  of  the  system  and  into  an  electronic 
computer  where  the  required  manipulations  would  be  performed.  This  leads  to  a  substantial  bottleneck  in 
the  processing.  Recently  Caudell  proposed  a  system  based  on  the  optical  vector  matrix  multiplier  which  uses 
smart  pixels  on  several  planes  in  order  to  obtain  the  required  functionality  [2].  Light  from  the  input  array 
illuminates  the  matrix  columns  and  this  light  is  collected  on  the  detector  array.  Depending  on  the  outcome 
of  this  calculation  the  stored  vectors  on  the  matrix  rows  may  be  either  updated  via  an  ANDing  operation 
with  the  input  vector  or  a  new  row  is  added.  Caudell  explains  the  operation  of  the  system  in  schematic 
terms  and  recently  we  have  implemented  the  matrix  array  using  liquid  crystal  on  silicon  (LCOS)  technology 
(4].  Each  pixel  of  this  two  dimensional  array  comprises  a  photodetector,  a  memory  dement  (flip-flop)  and  a 
modulating  dement  and  measures  75  jim  on  a  side. 

The  matrix  dement  is  thus  rdativdy  simple.  However,  the  output  array  has  considerably  more 
functionality  since  we  require  the  following  mathematical  operations. 


ti  ;  m 

Or  +  |T|  -  O  +  N 


(1) 


and 


TI 


|/| 


>  P 


(2) 


The  numerators  on  the  left  hand  side  of  both  these  equations  are,  of  course,  implemented  optically.  If  the 
normalizations  were  to  be  implemented  “directly"  we  would  need  analog  memory  and  division  circuitry.  A 
more  compact  way  however  is  to  adapt  the  circuitry  discussed  in  [5].  Consider  the  circuit  shown  in  figure  1 
where  the  current  I  which  flows  through  voltage  controlled  resistors  R1  and  R2  is 
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where  K,V,  and  K»V»  are  the  conductances  of  Rl  and  R2  respectively.  Then  the  voltage  V„f  measured  at 
the  junction  of  Rl  and  R2  is  (after  a  little  manipulation) 

V.i,  -  /  x  ^  ^  +  KtVk/Ka  ( 

Thus  if  we  can  make  the  template  moduli  proportion aJ  to  14  and  the  inner  products  between  the  inputs 
and  templates  proportional  to  V4  we  can  perform  calculations  of  the  form  required  for  equations  1  and  2.  A 
problem  however  is  the  storage  of  the  template  moduli  while  the  inner  products  are  being  calculated.  We 
get  round  this  problem  using  the  circuit  of  figure  2  at  each  pixel.  The  operation  of  this  circuit  is  as  follows. 
Light  first  falls  on  the  photodetectors  from  the  template  modulus  calculation,  generating  photocurrent.  14 
is  pulsed  high,  letting  the  gate  of  7\  charge  up.  The  final  voltage  on  Ti  is  determined  by  the  amount  of 
photocurrent  and  the  time  that  Vi  goes  high.  V}  now  goes  low,  trapping  charge  on  the  gate  of  7\.  Light 
now  falls  on  the  photodetector  from  the  inner  product  calculation.  If  the  voltage  V  on  transistor  T2  is  now 
set  to  a  small  value,  we  see  that  the  circuit  is  identical  to  that  of  figure  1  and  the  required  calculation  of 
equation  3  can  be  read  as  14 i$-  Once  the  signal  is  read  off  14  and  Vo  are  set  high  to  discharge  the  gate  of  T\ 
in  preparation  for  the  next  calculation. 

There  are  several  other  operations  that  need  to  be  implemented  for  the  ART  algorithm.  One  of  these 
is  the  selection  of  the  maximum  output  from  equation  1.  This  can  be  effected  using  a  winner-take-all  circuit, 
as  described  in  [0].  The  implementation  and  testing  of  the  circuitry  will  be  presented  at  the  conference. 
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Abstract:  A  novel  double-heterojunction  optical  PnpN  switch  is  presented  which  can  be  reset 
from  its  on-state  to  its  equilibrium  state  in  a  few  nanoseconds.  We  show  the  unique  potential  of 
this  device  as  fast  and  sensitive  optoelectronic  switch. 


Latching  devices  like  thyristors  are  ill  reputed  for  their  slow  turn-off.  In  a  PnpN  device,  the  reset 
operation  requires  the  removal  of  the  excess  free  carriers  in  the  center  n  and  p  layers.  For  device 
integration  in  arrays,  it  is  elegant  to  use  two-terminal  PnpN  devices,  and  perform  carrier  extraction 
by  applying  a  negative  anode-to-cathode  voltage  pulse  (VAr).  Furthermore,  it  is  also  important  to 
realize  that  carrier  extraction  on  its  own  is  not  sufficient:  for  the  switch  to  operate  uncritically  and 
with  optimum  sensitivity,  it  is  imperative  that  alt  free  carriers  of  the  center  layers  be  extracted.  No 
previously  reported  PnpN  device  allows  complete  carrier  extraction,  because  of  breakdown  of  one 
or  both  of  the  outer  junctions  under  negative  VA^_ 

The  key  to  PnpN  layer  structures  allowing  complete  carrier  extraction  was  revealed  by  numerical 
simulations  (Figure  1),  The  number  of  free  carriers  (per  unit  area)  in  the  center  n  and  p-layers 
during  turn-on  (between  0  to  2  ns)  and  turn-off  (from  2  to  10  ns)  are  shown  for  3  cases.  Case  A 
has  too  highly  doped  center  layers:  a  negative  pulse  of  -10  Volt  is  not  sufficient  for  complete 
carrier  removal.  Case  B  has  low  enough  doping  in  the  center  layers:  within  1.5  ns  all  earners  are 
extracted,  and  the  voltage  across  the  device  VAr  drops  to  the  applied  -10  V.  Case  C  shows  that 
without  negative  pulse,  carrier  decay  is  extremely  slow. 

Next,  we  have  fabricated  PnpN  switches  which  allow  complete  carrier  extraction.  Dopings  and 
layer  thicknesses  are  chosen  such  that  the  center  n  and  p  layers  are  already  completely  depleted  at 
equilibrium,  as  shown  in  Fig.  2a.  A  switch  can  be  reset  from  its  carrier-flooded  on-state  (Fig.  2b) 
to  the  zero-carrier  equilibrium  state  by  means  of  a  negative  pulse  lasting  less  than  10  ns  (detailed 
measurements  show  3  ns)  !  Figure  3  shows  the  static  I-V  characteristics  of  such  device.  Figure  4 
shows  that  after  reset  to  equilibrium  by  means  of  a  negative  pulse  beyond  -8  V  (region  C),  our 
PnpN  devices  can  be  pulsed  to  any  positive  voltage  smaller  than  the  static  breakover  voltage  (2.69 
V)  in  nanoseconds  without  unintentional  switch-on.  In  other  words,  our  thyristors  are  completely 
immune  to  dV/dt  triggering!  If  the  extraction  pulse  is  not  sufficient  to ‘reset  the  device  (less  than 
-8V),  dV/dt  immunity  is  not  complete  in  nanoseconds  (Fig  4),  but  necessitates  milliseconds 
(Figure  5).  This  is  typically  always  the  case  with  PnpN  devices  described  so  far  in  literature. 

Our  novel  PnpN  device  has  a  unique  potential  as  high-performance  optoelectronic  switch  (as  e.g. 
demonstrated  in  differential  configuration  in  [1]).  Amongst  all  PnpN  devices,  it  is  the  only  one  to 
permit  the  combination  of  high  speed  and  excellent  optical  sensitivity: 

High  speed  operation  is  possible.  The  reset  speed  is  intrinsically  limited  by  drift  and  diffusion  of 
the  carriers  being  extracted,  and  even  faster  reset  than  the  demonstrated  3  ns  is  expected  to  be 
possible.  After  reset,  the  thyristor  is  completely  immune  to  dV/dt  triggering.  It  can  therefore  be 
switched  to  any  voltage  close  to  the  break-over  voltage  in  nanoseconds  (instead  of  the  usual 
micro-  to  milliseconds)  without  unintentional  switch-on. 

Optical  Sensitivity  is  optimum:  Independently  of  the  detection  principle  used,  highly-sensitive 
light-reception  requires  to  have  a  well-defined  number  of  carriers  present  at  the  start  of  the 
incoming  light  pulse:  if  any  uncertainty  exists  on  this  value,  the  light  pulse  must  at  least  contain 
an  equivalent  number  of  photons  to  overcome  the  unknown  carrier  content.  Our  switch  provides 
ideal  and  most  reproducible  control  over  the  residual  carrier  density,  since  this  residue  is  zero. 


(1)  M.  Kuijk  et  al.,  “Optoelectronic  switch  operating  with  0.2  fJ/pm2  at  15  MHz”,  abstract  sent  in  to 
Optical  Computing  ’94,  Edinburgh,  August  1994. 


WP57/344 


WPe>8/345 


Resonant-tunneling  Triangular-barrier  Optoelectronic 
Switch  (R-TOPS)  for  Optical  Logic 

H.  Sakata,  K.  Utaka  and  Y.  Matsushima 
KDD  R&D  Laboratories 

2-1-15  Ohara,  Kamifukuoka-shi,  Saitama  356,  Japan 
Phone:+81-492-66-7831  FAX:+8 1-492-66-75 16 

Abstract  -  We  report  a  novel  optical  bistable  device,  which  consists  of  a  double¬ 
barrier  resonant-tunneling  diode  and  a  triangular-barrier  phototransistor.  Clear 
negative  differential  resistance  and  optical  bistability  with  high  contrast  and 
high  sensitivity  are  demonstrated. 

An  optical  bistable  device  is  a  v  ~  r  ~>otical  computing,  and  a  lot  of  devices 
have  been  studied  so  far  for  this  A  t  As  a  photodetecting  element  in  the 
devices  several  kinds  of  structures  such  as  a  conventional  heterostructure 
phototransistor  (HPT)  have  been  studied  [1].  As  another  candidate,  a  triangular- 
barrier  phototransistor  (TBP)  with  a  very  thin  gate  layer,  which  has  been 
developed  in  GaAs  /  AlGaAs  [2],  is  very  promising  from  a  view  point  of  high 
speed  and  high  sensitivity.  On  the  other  hand,  a  resonant-tunneling  diode  (RTD) 
has  a  potential  to  give  a  rise  to  unique  functions  attributed  to  its  negative 
differential  resistance  (NDR).  In  this  paper,  we  propose  a  novel  bistable  device, 
resonant-tunneling  triangular-barrier  optoelectronic  switch  (R-TOPS),  in  which 
a  TBP  and  a  double-barrier  RTD  (DB-RTD)  are  integrated. 

Figure  1  shows  a  band  diagram  of  the  device  fabricated.  It  was  composed  of 
InGaAs  /  InAlAs  for  1.5  pm  wavelength  range  operation.  The  TBP  had  n+-i- 
5p+-i-n+  structure,  and  in  the  DB-RTD  a  well  layer  and  barrier  layers  were 
InGaAs  and  AlAs,  respectively.  The  device  was  grown  on  an  n+-InP  substrate 
by  gas  source  MBE  and  was  etched  into  a  mesa  with  60  pm<t>.  Figure  2  shows  a 
schematic  load-line  graph  for  R-TOPS.  A  thick  curve  is  the  characteristic  of  the 
DB-RTD,  and  three  thin  curves  are  the  ones  of  the  TBP  at  different  input-light 
powers  PiJ*2  andP3,  which  show  exponential-like  dependence  of  current  upon 
voltage  due  to  thermionic  emission.  By  changing  the  input-light  power,  the 
intersection  varies  as  shown  in  Fig.2.  Experimental  I-V  characteristics  at 
different  input-light  powers  are  shown  in  Fig.3.  The  substrate  side  was  normally 
positively  biased  at  Vb,  and  the  wavelength  of  input-light  was  1.55  pm.  Clear 
NDR  with  bistability  was  observed  as  predicted  in  Fig.2  ,  and  a  peak-valley 
ratio  was  3.6 .  These  properties  show  that  both  TBP  and  DB-RTD  worked  well. 
The  bistability  in  the  relation  between  input-light  power  and  output-current  was 
obtained  as  shown  in  Fig.4.  It  operated  at  the  input-light  power  as  small  as  50 
pW  or  even  smaller,  and  the  output-current  change  was  about  10  mA,  which 
was  large  enough  to  drive  other  optical  devices  such  as  a  laser  diode.  Figure  5 
shows  the  optical  bistability  at  different  bias  voltages,  which  was  obtained  by 
connecting  the  device  and  a  laser  diode.  Gear  optical  bistability  and  sharp 
optical  NDR  property  with  high  contrast  and  low  input-light  power  was 
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observed.  An  intrinsic  optical  gain  was  estimated  to  be  larger  than  6  dB  from 
fiber  to  fiber.  These  characteristics  show  the  possibility  of  XOR  logic  operation 
as  well  as  optical  memory . 

In  conclusion,  we  fabricated  a  novel  R-TOPS,  and  clear  NDR  and  optical 
bistability  were  obtained.  The  device  is  expected  to  play  an  important  role  for 
optical  computing. 

[1]  Y.Kawamura  et  al.  IEEE  J.  Quantum  Electron.,  QE-28, 308  (1992) 

[2]  C.Y.Chen  et  al.,Appl.Phys.Lett.  39, 340  (1981) 
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Fig.l  Band  diagram 


Fig.3  I-V  characteristics  at  different 
input-light  powers 
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Fig.4  Output-current  vs.  input-light 
powers  at  different  bias  voltages 


Fig.5  Optical  bistable  characteristics 
at  different  bias  voltages 
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Abstract 


We  simulate  heat  transfer  in  hybrid  Silicon  IMplanted  OXide  optical  logic  gate*  end  link  it  to  their  static  end  dynamic 
behaviour.  We  compare  our  aimulstions  with  experimental  data  and  draw  conclusions  for  optimization.  The 
potentialities  of  the  device  in  view  of  its  compatibility  with  VLSI  circuitry  are  discussed. 

Summary 


In  this  paper  we  repent  on  the  progress  which  has  been  made  in  the  fabrication  and  use  of 
2-D  optical  logic  planes  consisting  of  silicon  Hybrid  Electrically  Assisted  Thermo-Optic 
Resonators  (HEATORs)  [1], 

The  HEATOR-structure  combines  a  pixellated  nonlinear  microresonator  with  the  properties 
of  a  pin-photodiode.  When  reverse  biasing  the  device  it  can  be  used  as  logic  gate  [2],  while  a 
forward  bias  turns  it  into  a  high-speed  spatial  light  modulator  [3].  Moreover  we  believe  it  is 
possible  to  extend  its  functionality  by  providing  both  active  intrapixel  and  interpixel  MSI,  LSI 
and  even  VLSI  circuitry,  thanks  to  the  unique  combination  of  both  the  hybrid  character  of  the 
device  and  its  silicon  nature.  The  present-day  devices  might  therefore  evolve  to  more 
sophisticated  smart  pixels.  The  ability  to  store  data  at  a  pixel  can  be  used  to  implement  memory 
functions  (although  this  might  be  done  exclusively  with  electronic  circuitry).  Furthermore, 
HEATORs  integrated  with  Si-photodetectors  or  HEATORS  used  as  photodetectors  can  lead  to 
on-chip  sensors.  In  combination  with  electronic  circuitry,  the  HEATORs  can  be  used  to 
perform  intrapixel  processing  with  low  level  processing  capabilities  such  as  signal 
amplification,  thresholding,  summation  and  logical  operations  that  can  be  performed  on  input 
and  stored  data.  Finally,  since  the  HEATORs  also  feature  SLM  properties,  they  can  be  used  to 
optically  output  data  and  to  provide  interpixel  communication  via  local  and  global  interconnects. 

In  the  past  we  developed  and  characterized  two  types  of  HEATORs:  the  first  generation  was 
fabricated  with  double-sided  polished  Si  substrates,  the  second  generation  used  silicon-on- 
sapphire  substrates  (SOS)  [2].  The  second  generation,  relying  on  an  epitaxially  grown  silicon 
layer,  overcomes  the  thickness  nonuniformity  of  the  first  type.  Moreover,  the  mesa-like  shape 
of  the  second  generation  resonators  prevents  lateral  heat  diffusion,  lowering  switching  power 
and  eliminating  interpixel  thermal  cross-talk.  The  second  generation  prototypes  display  optical 
bistability  at  much  lower  input  powers  than  the  Erst  generation,  but  still  need  mW-switching 
powers. 


We  now  have  fabricated  a  third  HEATOR  type  with  a  Silicon  Implanted  Oxide  (SIMOX) 
substrate  that  overcomes  the  second-generation  shortcomings  [3].  The  device  structure  that  is 
depicted  in  figure  1  is  obtained  by  implanting  an  oxide  layer  (D=0.43  pm)  in  a  silicon  substrate 
such  that  the  Si-Si02-Si  interface  acts  as  a  mirror  with  a  reflectivity  of  R=0.55.  Then  three  Si 
epilayers  flayer  1: 10000A  p+  1018-1019  cm  3,  layer  2: 45000A  non  doped,  layer  3: 50(X)A  n+ 
1018-1019  cm  3)  are  grown  and  we  etch  the  mesa  with  RIE  and  standard  lithographic 
techniques.  Finally,  a  reflective  dielectric  coating  (R=0.8)  is  deposited  on  top  of  tire  pixel  thus 
increasing  the  finesse  of  the  resonator  cavity  to  F=3.3.  If  we  etch  the  Si  substrate  right  down  to 
the  Si02  layer,  we  can  deposit  on  the  bottom  a  coating  with  a  reflective  coefficient  R-l,  and  on 
tire  top  side  a  coating  with  R*4).9.  In  that  case  tire  finesse  of  tire  resonator  would  be  F-12.2. 

An  efficient  operation  of  these  devices  requires  a  high  increase  of  the  resonator's  temperature 
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AT  when  illuminated.  In  order  to  gain  insight  into  the  temperature  distribution  within  this 
structure  and  to  optimize  the  temperature  rise  AT,  we  have  developed  a  simulation  program  in 
which  we  use  a  finite-difference  method  to  solve  the  partial  differential  equations  that  describe 
the  heat  transfer  in  the  device.  Besides  the  dynamical  build  up  of  the  temperature  in  the 
resonator  cavity  (figure  2)  die  program  also  calculates  at  any  instant  the  optical  output  of  the 
device  for  a  variable  optical  input  To  that  end  it  takes  into  account  all  the  characteristics  of  die 
resonator.  In  our  paper  we  show,  using  the  heat  transfer  model,  that  these  devices  can  display 
bistability  at  multi-microwatt  optical  Inks  power  and  that  when  adequately  dimensioned,  cross¬ 
talk  can  be  avoided  such  that  they  can  be  packed  into  dense  arrays.  Furthermore,  we 
demonstrate  that  an  appropriate  geometrical  design  of  the  structure  combined  with  proper  heat 
sinking  allows  microsecond  switching  times.  We  compare  experimental  data  with  the  results 
that  follow  from  the  numerical  modeling  and  show  that  the  scaling  down  of  the  resonator 
dimensions  considerably  decreases  the  required  switching  power.  We  investigate  on 
phenomena  such  as  critical  slowing  down  and  discuss  the  further  improvements  that  can  be 
made  to  these  devices  from  the  viewpoint  of  switching  times,  biasing  powers  and  switching 
energies.  We  will  show  with  experimental  data  that  the  use  of  thin  Si-resonators  allows  the 
operation  of  these  devices  from  400nm  to  1100  nm  leading  to  a  true  wavelength-flexible, 
spectral  linewidth  tolerant  operation.  Finally,  we  will  discuss  the  potentialities  of  this  device  as 
a  smart  pixel,  when  considering  its  integration  with  silicon  LSI,  MSI  and  VLSI  circuitry. 
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All-optical  logic  devices  are  imperative  for  the  development  of  optical  computing.  Although  semiconductor 
devices  using  resonant  nonlinearity  usually  exhibit  large  An,  their  large  linear  absorption  may  cause  low  device 
throughput  and  undesirable  thermal  effects.  Here  we  suggest  a  new  kind  of  devices  based  on  nonlinear  excited- 
state  absorption,  which  are  advantageous  for  their  low  linear  absorption,  test  response  and  minoriess  structure. 
Cw  and  some  metal-organic  materials  are  employed  to  demonstrate  the  principle  of  such  photonic  devices. 

A  six-level  model  for  organic  molecule  is  proposed  as  shown  in  Fig.  1 .  Photons  with  frequency  r  off  the  peak 
of  ground-state  absorption  are  simultaneously  absorbed  by  molecules  in  Sg,  S„  Sj  and  T,.  Populations  of  n^  n, , 
n,,  n,.  in  Sg,  S, ,  S,,  T„  and  the  photonic  flux  d  can  be  obtained  from  following  equations: 

(d/dt)n, =o#da0-eidurKi<rirK*rni  +Kjin2  (1) 

(d/3t)iij = (2) 

(d/dt)%=-u1dnr'KT0®T+Knoi  (3) 

N  “Uo+ni+Hj+nr  (4) 

(d/dz)d  -(0^v>-fe,n,  +ejnj+ 0!%)^  (5) 

where  Kg  are  transition  probabilities,  d-Uh'.  Assume  that  I=I_(z)fi(t),  here  I.(z)  is  the  peak  intensity  at  z,  f(t) 
is  temporal  function  of  the  pulse.  In  above  equations  t^  can  be  neglected  for  ns  laser  pulses  and  %  can  be 
neglected  for  ps  laser  pulses. 


■  u  u  at  an  mu 
Input  (I/cn  t) 


Fig.  1  Energy-level  structure  Fig.2  Transient  absorptive  optical  bistability 

Assume  that  a  laser  pulse  with  15ns  width  (Urge  than  mtersy stem-crossing  tune  constant)  at  532nm  are 
employed  as  the  light  source.  For  a  copper  phthalocyanine  solution,  we  simulated  the  characteristics  of  transient 
absorptive  optical  bistability  and  the  curve  of  output  I(t,L)  versus  input  l(t,0)  by  using  Eq.(l)-{4).  The  calculated 
results  are  consistent  with  experimental  date  as  illustrated  in  Fig. 2.(1] 


Input  Ploenoe(raJ/cin<) 


Fig. 3  Characteristic  curves  of  optical  limiters 


Fig.4a  Experimental  results  for  all-optical  switching 
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Simulation  of  optical  in  C.  solution,  and  transmission  T-Fu/F0  versus  input  flueoce  by  using 

Eq.(lH5),  an  also  consistent  with  experimental  data  (Fig.3)(2]. 

When  a  pulse  laser  with  ISas  at  532am  pumps  a  C»  solution  of  toluene,  a  large  population  can  be 
accumulated  in  T,.  If  in  the  same  time  a  CW  laser  diode  beam  at  747nm  (coincides  with  absorption  peak 
wavelength  of  excited  state  T,)  passes  through  the  sample,  photons  of  which  will  be  intensively  absorbed.  So 
that  the  output  intensity  switches  off.  This  all-optical  switching  process  is  shown  in  Fig. 4  (a).  The  simulation 
can  be  made  by  using  Eq.(lM4)  for  pump  beam  and  Eq.5  for  probe  beam,  as  shown  in  Fig. 4  (b)[2]. 

a 


Tina*  M 


Fig. 4b  Calculated  results  for  switching  Fig.5  Relative  transmission  versus  peak 

process  in  C  intensity  of  pumping  beam 

Fig.S  gives  the  simulated  and  experimental  results  of  optical  modulation  and  relative  transmission  of  probe 
beam  T/T0  versus  peak  pumping  intensity  1*.  A  design  for  'Exclusive  AND*  and  'Exclusive  OR*  gates  based 
on  this  characteristics  is  shown  in  Fig.6.[3]. 

When  using  8ps  pulses  at  532nm  to  act  on  a  cadmium  texaphyrin  solution  we  observed  that  reverse  saturable 
absorption  was  exhibited  only  at  low  fluencee.  At  high  flueoces,  however,  saturable  absorption  was  appeared, 
as  illustrated  in  Fig.  7.  The  reason  is  that  lifetimes  of  S,  and  Sj  in  cadmium  texaphyrin  are  comparable  to  laser 
pulse  width,  so  tij  can  not  be  neglected,  and  ff2<ff,[4].  The  simulated  results  of  T  versus  F0  by  using  (l)-(5) 
agree  well  with  experimental  data.  This  kind  of  curve  may  be  useful  for  optical  logic  operations  in  optical 
computing. 


Fig. 6  Principle  of  logic  operations  Fig. 7  Experimental  and  theoretical  results  for 

cadmium  texaphyrin 

Reference 

[1]  Chun-Fei  Li,  Lei  Zhang,  Miao  Yang,  Hui  Wang,  and  Yu-Xiao  Wang,  Physical  Review  A,  49,  (1994). 

[2]  Chun-Fei  Li,  Yu-Xiao  Wang,  Feng-Yun  Guo,  Rui-Bo  Wang,  ami  Lei  Zhang,  ACTA  PHYSICA  SINICA, 
42,  1236  (1993). 

[3]  Chun-Fei  Li,  Miao  Yang,  Feng-Yun  Guo,  Yu-Xiao  Wang,  and  Kunu  Tads,  international  Journal  of 
Nonlinear  Optical  Physics,  2,  (1993). 


[4]  Jin  Hai  Si,  Miao  Yang,  Yu-Xiao  Wang,  Lei  Zhang,  and  Chun-Fei  Li  to  be  published. 


WP6l/Vil 


Polychromatic  all-optical  gate  based  on  second-order  nonlinearity 

V.P.  Torchigin  and  A.E.  Kobyakov 

Scientific  Computer  Center,  Russian  Academy  of  Sciences,  Moscow,  Leninsky  pr.  32A, 

117334,  Russia,  tel  (095)-938-1893,  fax  (09 S)-9 38-6986,  e-mail  user@comcp.  msk. su 

A  possibility  of  using  known  quasi-phase-matched  second-harmonic  generators  as  the 
main  components  of  an  all-optical  gate  for  performing  logical  operations  on  WDM  signals  is 
considered.  The  main  parameters  of  the  gate  are  estimated. 

The  proposed  approach  enables  the  unique  possibility  of  development  of  polychromatic 
all-optical  gates  for  processing  wave-division  multiplexed  (WDM)  signals  so  that  the  overall 
rate  of  signal  processing  can  coincide  with  the  maximal  bit-rate  in  a  short  optical 
communication  line. 

A  schematic  diagram  of  the  proposed  gate  is  shown  in  Fig.  1 .  The  horizontal  nonlinear 
integrated-optics  lightguide  has  several  identical  periodical  domain  inverted  structures  (PDIS), 
which  are  usually  used  in  a  well-known  quasi-phase-matched  second-harmonic  generator 
(QPM  SHG)  [1].  Optical  power  enters  in  the  input  of  the  lightguide  in  the  form  of  a  periodical 
sequence  of  pulses  with  a  carrier  by  The  frequency  selective  directional  couplers  (DC)  are 
used  both  for  entering  the  logical  signals  A,  B,  and  C  with  carriers  o>s  (cas<ey)  and  for 
extraction  of  the  result  signals  X,  Y,  and  2  with  the  same  carriers.  The  cross-transmission 
coefficient  of  the  DC  is  close  to  one  for  the  signals  with  the  carrier  b>s  and  to  zero  for  the 
signals  with  the  carrier  by  In  such  a  device  the  pump  by,  signal  by,  and  idle  o>,  carriers  satisfy 
the  following  conditions  of  quasi-phase-matching. 

Ak=2it/A  (1),  by=b>s+b)j  (2),  where  Ak==(n(by)by-n(b)s)b)s-n(Q)i)o)i)/c  (3), 
n(b>)  is  an  effective  refractive  index  of  the  lightguide,  A  is  the  period  of  PDIS  in  the  lightguide, 
c  is  the  velocity  of  light  in  vacuum.  In  the  particular  case  when  b)s=tOj=oy/2  equation  (1) 
represents  the  QPM  condition  for  an  SHG. 

As  the  waves  with  the  carriers  by,  cos  and  arbitrary  phases  enter  the  PDIS  lightguide  the 
interaction  mentioned  manifests  itself  in  appearing  the  idle  wave  with  the  carrier  b)j  and 
increase  in  the  intensity  of  the  waves  with  the  carriers  b)s  and  b)j  and  in  decrease  in  the  intensity 
of  the  wave  with  the  carrier  by  [2].  In  this  case  the  PDIS  lightguide  is  used  as  an  optical 
parametric  traveling  wave  amplifier  (OPTWA).  The  intensity  of  the  wave  with  the  carrier  oy 
vanishes  on  some  distance.  The  length  of  the  OPTWA  is  chosen  equal  to  this  distance,  so  that 
there  is  no  wave  with  the  carrier  by  at  its  output.  This  situation  takes  place  if  a  small  control 
signal  with  the  carrier  b>$  enters  in  tne  input  of  the  OPTWA. 


Fig.  1  Schematic  diagram  of  the  gate  Fig.  2  Dependence  of  the  carriers  cos  and  cOj  of  signal 

and  idle  waves  cm  the  carrier  cy  of  pump  in  an 
OPTWA  if  the  QPM  condition  is  satisfied 
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The  signal  with  the  carrier  tos  is  present  at  the  output  X  only  when  the  control  signal  A 
with  the  carrier  to,  exists  at  the  input  A ,  i.e.,  the  output  signal  corresponds  to  the  input  one 
amplified  by  power.  Analogously,  the  signal  with  the  carrier  0)3  presents  at  the  output  Y  only  in 
the  absence  of  the  signal  A  and  presence  of  the  signal  B,  i.e.,  the  result  signal  at  the  output  Y 
corresponds  to  the  logical  function  2b.  Generally,  the  gate  can  have  N  logical  inputs  A,B,C,... 
for  control  signals  and  N  logical  outputs  for  result  signals  X,Y,Z,..  The  result  signals 
correspond  to  the  following  functions.  X=A,  Y=AB,  Z=JSC,  It  is  easily  seen  that  even  the 
gate  which  has  only  two  logical  outputs  is  boolean  complete. 

The  usual  well-known  DCs  can  be  used  for  physical  realization  of  the  gate.  Because 
localization  of  light  wave  field  in  a  cross-section  of  a  lightguide  decreases  on  lower  carriers, 
the  beating  length  L&  for  a  wave  with  the  carrier  cos  can  be  obtained  essentially  smaller  than 
for  a  wave  with  the  carrier  C0p.  The  DC  length  is  chosen  equal  to  L^2  so  that  the  cross 
transmission  coefficient  is  equal  to  one  for  the  wave  with  the  carrier  tos.  As  Zs«Zp,  for  a 
wave  with  the  carrier  co-  this  coefficient  is  close  to  zero. 

One  more  essential  advantage  of  the  gate  is  its  capability  of  simultaneous  processing 
several  sets  of  signals.  Actually,  QPM  condition  (1)  can  be  satisfied  at  the  same  time  for 
several  sets  of  signals  <0p  and  cos,  which  can  be  chosen  so  that  Ak  be  constant.  Therefore, 
several  signals  tosj  (j— 1,  2,  ....  N)  can  be  simultaneously  amplified.  In  other  words,  any  SHG 
with  ©s=C0p/2  for  which  equation  (1)  is  satisfied  can  be  used  as  an  OPTWA  with  other  values 
of  ODp  and  cos  for  which  (1)  is  also  valid. 

In  the  particular  case  when  n(o>)  is  a  linear  function  of  co,  dependence  of  Q)s  and  o,  on  ojp 
is  shown  in  Fig.2.  The  carrier  (Of  corresponds  to  the  fundamental  harmonic  when  an  OPTWA 
is  used  as  an  SHG,  for  cop>2(Of  each  value  of  (0p  corresponds  to  the  only  value  of  oos  and  vice 
versa.  One  should  note  that  for  pairs  (Opj  and  (0^  (j*k)  QPM  condition  is  not  satisfied  and  the 
average  effect  of  their  interaction  is  equal  to  zero. 

The  power  gain  of  the  gate  is  equal  approximately  to  that  of  an  OPTWA  and  can  exceed 
20-30  dB.  The  contrast  ratio  of  the  gate  is  larger  than  its  power  gain  by  the  value  of  losses  in 
the  PDIS  lightguide.  The  bit  rate  F\  is  determined  by  the  difference  in  group  velocities  of 
pulses  with  the  carriers  (Op  and  oas  in  an  OPTWA  and  by  its  bandwidth.  Assuming  that  the 
difference  in  velocities  is  equal  to  8%  [3]  and  the  bandwidth  of  an  OPTWA  having  the  length 
of  about  1  cm  is  -0.2  nm  [3],  we  obtain  F\=25  GHz.  The  maximal  value  of  JV  is  determined  by 
the  expression  N=Ft/F\,  where  Fl  is  the  total  bandwidth  of  an  OPTWA  for  various  pumps.  In 
this  case  N=  103.  The  pump  power  for  1  cm-length-OPTWA  is  equal  to  several  Watts,  the 
control  signal  power  is  20-30  dB  less. 

References 

1.  Arvidsson,  G  .  Laurell,  F.,  and  Webjom,  J.:  'Compact  light  sources  based  on  frequency 
conversion  in  lithium  niobate  and  other  nonlinear  optical  crystals',  Proc.  of  the  6-th  European  Conf.  on 
Integr  Optics  ESIO-93,  Apr.  18-22, 1993,  Neuchatel,  Switzerland,  pp.  3-23  -  3-25 

2.  Bloembergen,  N.:  'Nonlinear  optics'  (W.  A.  Benjamin,  Inc.  New  Y ork- Amsterdam,  1965) 

3.  Yamamoto,  K.,  Mizuuchi,  K.,  and  Taniuchi,  T.:  'Quasi-Phase-Matched  Second  Harmonic 
Generation  in  a  LiTaO  Waveguide',  IEEE  Joum.  of  Quant.  Electron.,  1992,  28,  9,  pp.  1909-1914 


WP62/353 


All-optical  Bistable  Devices  (ORION)  using  Laser  Diodes  coupled  to 
Interference  Filters  of  Narrow  Spectral  Bandwidth 

Yoshinobu  MAEDA 

Light  and  Material  Group,  PRESTO,  Research  Development  Corporation  of  Japan  (JRDC). 
Department  of  Information  and  Control  Engineering,  Toyota  Technological  Institute, 
2-12-1  Hi sakata,  Tempaku,  Nagoya  468,  Japan.  Phone:  81-52-802-1111 


Abstract:  All-optical  Instable  devices  were  demonstrated  by  using  die  longitudinal  mode 
hopping  of  die  laser  diode  and  the  narrow  transmission  spectrum  of  an  interference  filter. 
Optical  switch-on  and  -off  were  confirmed  by  directly  injecting  a  dye  laser. 

Optical  bistability  in  laser  diodes  (LDs)  is  a  most  interesting  subject  because  of  its 
many  advantages;  for  example,  such  diodes  have  optical  gain,  can  provide  large  fan-out 
and  can  have  operating  times  less  than  a  sub-nano  second.  Bistable  optical  devices  were 
confirmed  by  using  the  longitudinal  mode  hopping  of  die  laser  diode  and  the  narrow 
absorption  band  of  erbium  in  an  yttrium  aluminum  garnet  crystal  (ErYAG),  and  referred 
to  as  ORION  (Optical  logic  devices  using  die  Red  shift  of  a  laser  diode  and  Inversion  in 
Optical  absorbers  (or  filters)  of  a  Narrow  spectral  bandwidth). 1_3)  However,  when  using 
an  absorber  such  as  an  ErYAG  crystal  as  reported  earlier,  no  absorption  wavelength  can 
be  utilized  other  than  that  specific  to  the  absorber.  Furthermore,  there  was  a  disadvantage 
that  ceramics  like  YAG  hardly  conform  to  semiconductor.  This  paper  shows  that  the 
ORION  device  was  successfully  demonstrated  by  using  an  artificial  dielectric  interference 
filter  in  place  of  the  natural  absorber  such  as  the  ErYAG.  By  using  the  interference 
filters  in  this  study,  it  becomes  possible  to  use  an  arbitrary  wavelength  and  form  the  filter 
on  semiconductors  using  vacuum  evaporation  with  ease.  The  optical  switch-on  and  -off 
phenomena  were  observed  by  direct  injection  of  an  external  dye  laser  beam  into  the  LD. 

The  light  source  was  a  high  power  AlGaAs  laser  diode  operating  in  the  single 
transvase  mode.  The  threshold  current  is  50mA,  the  operating  current  at  optical  output 
of  30mW  is  100mA  and  the  slope  efficiency  is  0.6mW/mA.  Its  oscillation  wavelength 
was  kept  stable  by  adjusting  die  temperature  of  the  laser-head.  The  dielectric  interference 
filter  was  formed  by  vacuum  evaporating  1 1  layers  of  866A  of  Ti02  and  1335A  of  Si02 
alternately  on  a  glass  substrate  heated  to  300°C,  then  placing  2670A  of  Si02  as  a  spacer, 
and  adding  another  1 1  layers  of  Ti02  and  Si02  alternately  with  die  same  thicknesses. 
The  refractive  indices  of  Ti02  and  Si02  films  were  2.25  and  1.46,  respectively.  Optical 
waveforms  were  determined  with  a  digital  oscilloscope  with  a  bandwidth  of  1GHz  and  a 
photodetector  with  a  rise  time  of  90psec.  An  external  dye  laser  excited  with  a  nitrogen 
(N^  laser  was  injected  through  a  beam  splitter  into  the  LD.  The  dye  laser  produced  a 
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beam  with  a  central  wavelength  of  546nm,  a  spectral  width  of  2nm  and  a  pulse  width  of 
500psec. 

Figure  1  shows  relationship  after  transmission  through  the  interference  filter  between 
die  laser  power  and  the  injection  current  of  the  LD  in  continuous  operation  measured 
with  an  optical  power-meter.  It  was  observed  that  the  bistable  curve  fell  rapidly  at  82mA 
as  die  injection  current  increased,  and  in  contrast,  rose  rapidly  78mA  when  it  decreased. 

Figure  2(a)  exhibits  the  optical  intensity  variation  after  transmission  through  the  filter 
when  the  pulsed  dye  laser  in  Fig.2(b)  was  injected  into  the  LD  during  continuous 
oscillation  at  an  injection  current  of  79mA  (Point  A  in  Fig.l).  The  optical  switch-on  and 
-off  are  observed  when  die  trigger  light  pulse  of  Fig.2(b)  is  applied.  The  rise  and  fall 
time  are  around  lnsec  and  these  are  limited  by  the  bandwidth  of  the  oscilloscope. 

In  summary,  since  a  hysteresis  characteristic  exists  in  the  relationship  between  the 
wavelength  and  the  injection  current  of  the  LD,  an  optical  bistability  was  observed  in  this 
system.  In  addition,  the  optical  switch-on  and  -off  phenomena  were  observed  by  directly 
injecting  a  pulse  500psec  wide  into  an  external  dye  laser.  It  was  confirmed  that  the  device 
could  convert  changes  in  the  wavelength  of  the  LD  into  changes  in  intensity  transmitting 
it  through  the  filter. 
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Fig.l  Laser  intensity  after  transmission 
through  the  filter  versus  injection  currer' 
of  the  laser  diode  characteristics  sh' 
hysteresis. 


Fig. 2(a):  Optical  intensity  variation  after 
transmission  through  the  interference 
filter  by  applying  the  trigger  light  pulse 
into  the  laser  diode,  (b):  Trigger  light 
pulse  of  dye  laser. 
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Abstract:  The  carrier  densities  needed  to  saturate  excitons  in  strained  InxGaj.xAs 
/GaAlAs  quantum  wells  decreases  with  increasing  x  and  hence  with  increasing  strain.  A 
factor  of  10  reduction  for  x  =  0.11  compared  to  x  =  0  occurs.  Implications  for  nonlinear 
devices  are  discussed. ' 

The  viability  of  all-optical  computing  depends  critically  on  finding  a  material 
structure  with  strong  enough  nonlinearity  for  array  devices  along  with  an  optimised 
architecture.  Semiconductor  quantum  wells  have  shown  good  promise  due  to 
excitonic  enhancement  at  resonance  near  the  bandedge.  While  the  densities  of 
carriers  necessary  to  saturate  the  excitons  at  room  temperature  are  low,  there  is  still 
need  for  much  lower  densities.  In  this  paper,  we  show  that  almost  an  order  of 
magnitude  reduction  in  saturation  densities  can  be  achieved  by  introducing  strain 
into  the  quantum  well.  We  show  that  for  0.11  In  in  InGaAs/GaAlAs  fully  strained 
wells,  the  saturation  density  is  <  1  x  1017  cm*3  compared  to  ~  8  x  1017  cm*3  in 
unstrained  GaAs  wells.  Previously  only  factors  of  2  reduction  have  been  observed  in 
this  system  [1, 2]. 

Samples  of  InxGai-xAs/GaAlAs  quantum  wells  were  grown  by  MOVPE  with  x  =  0, 
0.03,  0.11  and  0.15,  the  latter  three  labelled  Si  to  S3,  respectively  in  Figure  1.  Ten 
periods  of  8  nm/8  nm  wells/barriers  were  grown  in  each  sample.  Samples  with  x  = 
0.03  and  0.11  were  fully  strained,  while  the  x  =  0.15  sample  was  partially  strain 
relieved,  as  shown  by  cross-hatching  under  a  Nomarskii  microscope.  A1  was  used  in 
the  barrier  to  increase  the  well  depths.  The  carrier  lifetimes  in  these  InGaAs  wells 
with  AlGaAs  barriers  have  been  shown  to  be  uniformly  short  [3],  about  0.5  nsec,  and 
independent  of  In  concentration  or  strain  relief.  We  attribute  the  effect  to  defects  in 
the  AlGaAs  barriers.  To  probe  the  effect  of  the  strain  on  the  nonlinearity,  we 
measure  and  calculate  the  saturation  carrier  den  .a y  instead  of  saturation  intensity, 
since  the  latter  also  includes  lifetime  effects.  To  do  this,  we  measure  the  absorption 
spectra  near  the  exciton  resonance  in  each  sample  as  a  function  of  input  optical 
intensity  I,  and  from  the  absorption  coefficient  a  (X,  I)  we  can  calculate  the  carrier 
density  N(X,  I)  =  a  (X,  I)  x  /E(X)  where  x  is  the  measured  carrier  lifetime 
independent  of  intensity  over  the  range  used,  and  E(X)  is  the  photon  excitation 
energy.  The  saturation  of  the  absorption  at  the  exciton  peak  follows  a  (I)  =  a  (o)  /  (1 
+  N/Nsat)  where  NSat  is  the  saturation  carrier  density.  This  equation  is  only  valid 
when  far  from  intensities  at  which  absorption  switches  over  to  gain.  Figure  1  shows 
a  plot  of  Nsat  against  In  concentration  for  the  four  samples  measured.  Nsat  for  x  =  0 
is  within  the  range  of  values  measured  by  other  groups  and  the  x  =  0.03  is  similar  to 
this.  A  large  decrease  of  a  factor  of  8  in  Nsat  occurs  between  x  =  0.03  and  0.11 
showing  that  strain  has  a  dramatic  effect  on  nonlinearity  as  expected  from 
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modification  of  the  valence  band  density  of  states  by  strain.  The  0.15  sample  is 
similar  to  the  0.11  sample  and  this  can  be  attributed  to  the  partial  strain  relief  of  this 
sample.  Also  shown  is  a  value  of  Nsat  derived  from  a  previous  experiment  [1]  on  an 
asymmetric  Fabry-Perot  modulator  with  a  large  number  (30)  of  x  =  0.1  In  wells.  The 
large  value  for  Nsat  is  explained  by  the  significant  measured  strain  relief  in  that 
sample.  The  value  of  Nsat  reported  in  Ref  2  for  a  sample  similar  to  S2  is  only  about  a 
factor  of  2  less  than  in  GaAs  wells  indicating  possible  strain  relief  in  that  sample  also. 

In  summary,  a  reduction  of  almost  a  factor  of  10  in  the  saturation  carrier  densities  for 
exdtons  in  InGaAs  by  strain  has  been  observed.  This  is  expected  from  the  reduced 
density  of  hole  states  at  the  valence  band  edge  by  strain  and  consequently  an 
enhancement  of  the  effect  of  phase  space  filling  for  exciton  saturation.  The  data 
indicate  that,  for  higher  In  concentrations  and  full  strain,  NSat  may  keep  on 
decreasing.  This  would  have  a  large  impact  on  devices  based  on  exciton  saturation 
and  increase  their  attractiveness  for  optical  computing  architectures.  To  avail  of  this 
large  effect,  however,  the  problem  of  short  carrier  lifetime  with  AlGaAs  barriers 
because  of  defects  will  need  to  be  resolved.  For  higher  In  concentrations,  on  the 
other  hand,  GaAs  barriers  may  be  used  without  significant  thermal  activation  of  the 
carriers  over  the  barrier  occurring,  and  hence  without  deleterious  effects  on  carrier 
lifetime  and  sample  heating. 
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Indium  Concentration 


Figure  l:Plot  of  saturation  carrier  density  against  Indium  concentration  x  for  x=0.03 
(Si),  0.11  (S2)  and  0.15  (S3). 


WP64/357 


Strong  Optical  Nonlinearities  of  Semiconductor  Quantum  Dots  and  Wires. 
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Strong  and  fast  optical  nonlinearities  were  detected  in  CdSe,  CdSSe 
nanocrystals  and  porous  silicon.  The  observed  phenomena  were  explained 
in  terms  of  filling  of  quantum  well  energy  levels  by  carriers  excited  by 
picosecond  laser  pulses. 


Nonlinear  optical  properties  of  semiconductor  nanocrystals  and  wires  of 
nanometer  scale  size  (quantum  dots  -  QD  and  quantum  wires  -  QWr)  are  of  significant 
interest  in  view  of  their  possible  applications  in  optical  computing  ,  as  well  as  from  the 
fundamental  point  of  view.  The  spatial  confinement  of  carriers  in  these  objects  results 
in  the  drastic  modification  of  the  energy  spectrum  (the  quasi-continuous  spectrum  of 
bulk  semiconductor  is  replaced  by  the  discrete  one)  and  supposedly  leads  to  the 
enhancement  of  optical  nonlinearities.  We  have  studied  nonlinear  optical  transmission 
spectra  of  CdSe,  CdSSe  nanocrystals  and  porous  silicon  wires  using  time-resolved 
pump-probe  technique.  Powerful  picosecond  (20  ps)  pulses  of  the  second  harmonic  of 
a  mode-locked  Nd-YAG  laser  excited  the  samples;  the  delayed  pulses  of  picosecond 
continuum  were  used  to  probe  the  transmission. 

The  investigated  samples  of  CdSe,  CdSSe  nanocrystals  embedded  in  glass 
matrix  (semiconductor  doped  glasses  -  SDG)  were  prepared  by  the  method  of 
secondary  heat  treatment  and  contained  crystallites  of  the  average  radius  R  less  than 
lOnm.  Free-standing  porous  silicon  layers  were  prepared  by  electrochemical  etching  of 
n-type  (111)  Si  wafers;  the  films  were  removed  from  the  substrate  by  applying  a  short 
electrical  pulse.  Atomic  force  microscopy  data  indicates  that  our  porous  Si  samples 
contain  thin  silicon  wires  of  3-5  nm  diameter.  The  absorption  edge  in  CdSe, CdSSe 
QD  and  Si  QWr  is  shifted  relatively  to  the  absorption  edge  of  the  corresponding  bulk 
semiconductor  due  to  the  quantum  size  effect. 

Photoexcitation  resulted  in  the  significant  bleaching  above  the  absorption  edge 
of  quantum  dots  and  wires.  Up  to  three  bleaching  bands  have  been  observed  in  the 
nonlinear  transmission  spectre  of  CdSe, CdSSe  nanocrystals.  We  could  resolve  the 
thermalization  of  carriers  (transitions  from  higher  energy  levels  to  the  lower  one  seen 
as  the  changes  of  the  relative  amplitude  of  the  bleaching  bands)  followed  by  the 
recombination  of  electrons  and  holes.  A  sufficient  cuttoff  of  the  intraband  relaxation 
time  compared  with  the  bulk  semiconductor  was  observed.  The  spectral  position  of 
bleaching  bands  and  their  kinetics  allowed  to  attribute  this  bleaching  to  the  filling  of 
energy  levels  of  spatially  confined  electrons  and  holes  by  photoexcited  carriers: 
saturation  of  1S-1S,  1P-1P  and  even  higher  transitions  was  clearly  seen.  It  is  important 
that  these  bands  could  not  be  resolved  in  the  linear  absorption  spectra  (probably 
because  of  the  strong  inhomogeneous  broadening  due  to  the  size  distribution  of 
nanocrystals). 
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The  obtained  values  of  10-MO10esu)  for  SDG  samples  are  much  greater 
than  for  "classical"  nonresonant  nonlinearities. 

The  bleaching  bands  corresponding  to  the  transitions  between  the  levels  of  size 
quantization  were  observed  in  the  nonlinear  transmission  spectra  of  porous  crystalline 
Si  wires  while  the  linear  absorption  spectra  had  no  peculiarities  dealt  with  space 
confinement  probably  because  of  strong  inhomogeneous  broadening  caused  by  size 
dispersion  of  wires. 

The  induced  decrease  of  absorption  was  explained  by  filling  of  the  size- 
quantized  energy  levels  with  nonequilibrium  carriers  (saturation  effect).  Instead  of  the 
indirect  gap  of  crystalline  bulk  silicon,  the  band  structure  of  the  wires  exhibits  a  direct 
gap  -  the  projections  of  four  valey  minima  of  bulk  silicon  oriented  along  (100)  and 
(010)  directions  onto  the  wire  axis  are  at  the  zone  center.  The  energies  of  two 
bleaching  bands  correspond  to  that  of  size-quantized  wire  with  about  3x3  nm2  cross 
section  (the  lowest  transitions  Ehu,  E'n  for  electrons  with  effective  mass  in  (110)  and 
(T 10)  confinement  directions  and  heavy  and  light  holes).  Probably  only  part  of  the 
wires  with  proper  cross  section  (proper  energy  spectra)  were  resonantly  excited  by  the 
laser  beam.  The  E‘u-band  could  be  seen  only  at  zero  delay  of  the  probe  pulse;  the 
relaxation  time  of  Eh,,-band  for  porous  Si  (300  K)  was  about  40  ps. 

The  values  HCHesu)  and  dispersion  of  imaginary  part  of  third  order  nonlinear 
susceptibility^^  of  porous  Si  were  obtained. 

The  observed  strong  and  fast  nonlinearities  in  Si  quantum  wires  and  CdSe, 
CdSSe  quantum  dots  make  possible  future  applications  of  these  materials  for  high 
speed  optical  switching. 
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Abstract.  Carrier  heating  induced  suppression  of  band  filling  has  been  shown  as  an  effective 
mechanism  of  optical  nonlinearity  in  bulk  and  quantum  well  III-V  semiconductor  structures,  which  leads  to 
both  absorption  and  dispersion  types  of  optical  bistability  and  all-optical  switching  in  a  picosecond  time-scale. 

Photonic  switching  in  III-V  heterostructures  in  the  spectral  region  near  the  fundamental 
absorption  edge  attracts  much  attention  because  of  its  great  capability  for  optical  computing  and 
information  processing  [1].  An  important  line  of  activity  in  the  field  is  a  search  for  novel 
physical  mechanisms  of  optical  nonlinearity  (ON),  those  can  improve  characteristics  of  related 
digital  systems,  in  particular  their  speed.  Here  we  discuss  the  carrier  heating  induced 
suppression  of  band  filling  as  such  a  mechanism  in  both  bulk  semiconductors  (BS)  and 
quantum-well  structures  (QWS),  that  leads  to  all-optical  switching  in  a  picosecond  time-scale. 

The  nature  and  the  main  features  of  this  ON  may  be  explained  as  follows.  Consider 
degenerate  n-type  BS  or  QWS  with  thermalized  3D  or  2D  electron  gas  and  Fermi  quasilevel  <t> 
well  above  the  conduction  band  edge  or  bottom  of  the  ground  subband,  respectively.  The 
optical  absorption  is  determined  by  both  intraband  and  interband  radiative  transitions.  For 
photon  energy  too,  related  to  direct  interband  transition  into  the  state  a  little  bit  below  Fermi 
quasilevel,  exciton  effects  play  no  role  and  the  absorption  coefficient  Am  may  be  written  as: 


Aa  =  a»oN+  aJ1  ~ 

Here  am  is  the  cross-section  (per  one  electron)  of  intraband  absorption,  N  is  the  carrier 
concentration,  is  the  interband  absorption  coefficient  when  degeneracy  is  lifted,  /  is  the 

Fermi-Dirac  distribution  function,  em=  (p/mXhto  -  eg) ,  where  p  and  m  are  the  reduced  mass 
of  electron-hole  pair  and  effective  mass  of  electron,  respectively,  eg  is  the  energy  bandgap.  In  a 
case  of  QWS  Aa ,  N ,  and  am  all  are  thought  as  being  averaged  over  the  period  of  a  structure, 
while  eg  -  as  the  energy  gap  between  extrema  of  ground  electron  and  hole  subbands.  Usually 
aa*  aaF  so  interband  transitions  give  the  main  contribution  if  /  -+  0,  i.e.  in  the  absence 
of  degeneracy.  Otherwise,  if  Fermi  quasilevel  #  satisfies  inequalities: 

*-  tm>Tln(~^)  >T, 

where  T  is  the  effective  temperature  of  carriers,  f(tj  -*  l,  i.e.  interband  absorption  gets 
weakened  by  band  filling,  and  Am  -  amN.  However,  intraband  transitions  lead  to  generation 
of  energetic  carriers  and  under  the  thermalization  conditions,  those  are  typical  for  a  high-density 
degenerate  electron  gas,  they  give  rise  to  increase  in  temperature  of  a  Fermi  sea  as  a  whole.  The 
larger  intensity  of  light,  the  stronger  carrier  heating  and  when  T>  -  ea  the  ultimate  case  of 
f(ej  -*  0.5  and  Am  -  0.5an  is  achieved.  Thus,  we  have  an  ON  with  increasing  absorption 
caused  by  carrier  heating  suppression  of  band  filling  and  characterized  by  a  nearly  step- 
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function  dependence  of  absorption  coefficient  Am  on  light  intensity  lm .  Its  value  needed  for 
switching  from  weak  (intraband)  to  strong  (interband)  absorption  may  be  estimated  as: 


where  T0  and  xT  are  the  dark  temperature  and  characteristic  cooling  time  of  carriers,  f  is 

numerical  parameter,  different  in  cases  of  BS  and  QWS .  The  last  one  is  more  attractive  from  the 
power  point  of  view,  because  of  quantum  confinement  makes  possible  the  direct  intraband 
transitions  (from  ground  to  continuum  state,  for  example)  and  so  enhances  the  free-carrier 

absorption.  For  a  typical  GalnAsP/InP  QWS  parameters  (aa~  l(Tlscm2,  rr-  3ps,  N- 3»10 
17 cm'3,  and  an-  104 cm'1)  we  find  that  at  room  temperature  switching  from  weak  to  strong 

absorption  occurs  at/^  sJ06Wicm2.  Even  this  value,  related  to  full  eliminating  of  band  filling 
effect,  seems  to  be  not  an  extremely  large,  but  really  only  the  partial  suppression  of  band  filling 
is  needed  for  optical  switching.  The  speed  of  switching  from  weak  to  strong  absorption  is 
determined  by  thermalisation  of  energetic  carriers,  while  the  speed  of  a  back  switching  is 
limited  by  die  rate  of  cooling  of  a  Fermi  sea.  Under  the  actual  conditions  both  processes  lead  to 
a  time  delay  in  a  few  picoseconds  only  and  so  described  mechanism  of  ON  allows  to  improve 
speed  of  switching  compared  to  any  ones  involving  bipolar  generation  and  accompanying  slow 
recombination  processes  [1]. 

To  justify  the  adduced  qualitative  scenario  we  performed  the  detailed  theoretical  study 
and  numerical  simulation  of  carrier  heating  induced  ON  and  optical  bistability  (OB)  in  doped 
GalnAsP/InP  BS  and  QWS.  In  a  case  of  BS  the  impurity-assistant  indirect  transitions  are 
assumed  as  being  responsible  for  intraband  absorption.  They  are  described  under  the  second 
order  of  perturbation  theory,  and  arbitrary  degeneracy  as  well  as  plasma  screening  are  taken 
into  account.  In  a  case  of  QWS  the  direct  radiative  transitions  from  ground  subband  to 
continuum  are  considered  as  a  reason  for  intraband  absorption.  For  their  description  the 
convenient  envelope  function  approximation  is  employed,  but  spatial  variations  of  a 
longitudinal  effective  mass  and  quantum-size  effects  in  tire  continuum  states  also  are  involved 
into  consideration.  The  results  of  numerical  calculations  are  presented  in  the  form  of  photon 
energy  and  carrier  temperature  dependencies  of  material  absorption  coefficient  and  refractive 
index  (determined  trough  the  Kramers- Kronig  transformation).  They  show  that  the  main 
features  of  temperature  ON  in  degenerate  narrow-gap  semiconductors  as  n- type  GalnAsP/InP 
(both  BS  and  QWS)  -  the  red-shift  of  fundamental  absorption  edge  and  the  nearly  step- 
function  variation  of  Am  (T)  -  are  analogous  (with  interchange  of  T  and  N)  to  the  features  of 
well-known  concentration  ON  in  undoped  wide-gap  semiconductors  of  the  type  CdS  [2].  At 
the  same  time,  carrier  temperature  induced  ON  is  faster  than  carrier  concentration  induced  ON 
in  the  recombination  to  cooling  times  ratio,  i.e.  approximately  in  three  orders  of  magnitude. 

Carrier  heating  induced  OB  in  BS  and  QWS  is  studied  in  vertical  and  waveguide 
geometry,  respectively  (the  radiation  in  a  waveguide  is  assumed  TM  polarized).  So  far  as 
generation  of  nonequilibrium  carriers  is  unimportant  under  the  actual  conditions  and  both 
absorption  coefficient  and  propagation  constant  are  determined  by  carrier  temperature  only,  die 
self-consistent  treatment  of  a  nonlinear  light-carrier  interaction  is  based  on  the  coupled  set  of 
electron  thermal  conductivity  and  light  propagation  equations.  Also  die  nonequilibrium  phonon 
effects  are  taken  into  account  as  leading  to  a  lag  in  the  energy  relaxation  of  a  dense  electron  gas 
and  so  having  substantial  influence  on  carrier  temperature.  Then,  two  ultimate  cases  in  the 
conditions  for  appearance  of  OB  are  distinguished,  those  relate  to  resonatorless  OB  due  to 
increasing  absorption  and  dispersion  type  OB  in  nonlinear  asymmetric  resonator.  The 
numerical  simulation  is  carried  out  for  each  of  which,  that  shows  die  occurrence  of  hysteresis  in 
the  power  transmission  characteristics  and  the  possibility  of  all-optical  switching  in  a 
picosecond  time-scale.  It  is  found,  that  the  lowest  required  intensity  of  the  incident  light 
corresponds  to  the  case  of  dispersion  type  OB  in  a  QWS  asymmetric  waveguide  resonator. 

References.  1.  S.W.  Koch,  N.  Peghambarian,  and  H.M.  Gibbs.  /.  Appi.  Pkys.,  fi  Ri  (1988). 

2.  S.W.  Koch,  H.E.  Schmidt,  and  H.  Hang.  Appi.  Pkys.  932  (1984). 
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Aib§ttast; 

Diffraction  kinetics  of  transient  gratings  in  GaAs  epilayers  and  GaAs/GaAlAs 
multi-quantum  wells  at  room  temperature  are  reported.  Samples  are  prepared  by 
lift-off  epitaxy  The  results  are  interpreted  in  terms  of  time  separation  of  the 
electronic  ami  thermal  contributions 

SUMMARY 

The  aim  of  this  work  was  to  study  the  subnanosecond  diffraction  capabilities 
of  GaAs  epilayers  and  GaAs/GaAlAs  multi-quantum  wells  at  room  temperature. 
The  epitaxial  structures  were  grafted  onto  glass  slides  by  means  of  the  lift-off 
epitaxy  technique.  Diffraction  kinetics  of  transient  gratings  were  measured  using 
the  first  order  diffraction  in  the  Raman-Nath  configuration. 

Experimental  set-up:  Pulses  were  generated  directly  at  the  wavelength  1.064  Jim 
by  an  actively  mode-locked  Nd:YAG  laser.  Pulses  at  the  wavelength  0.532  pm 
were  generated  by  frequency  doubling  in  KDP  crystals.  The  duration  of  the  pulses 
was  approximately  30  ps.  These  pulses  synchronously  pump  a  dye  laser  operating 
in  the  range  (825-850  nm)  The  duration  of  the  pulses  was  approximately  15-20 
ps.Two  equal  energy  pump  pulses  at  the  wavelength  0.532  pm  (PI  and  P2) 
interacted  inside  the  sample  to  produce  transient  gratings.  The  external  angle 
between  these  two  pump  beams  was  26  =0.1  radian,  corresponding  to  a  grating 
period  A  of  10  pm.  The  grating  was  then  read  by  a  probe  pulse  (T)  generated  by 
the  dye  laser  and  directed  in  normal  incidence  to  the  sample  surface  The  probe 
pulse  was  delayed  with  respect  to  the  pump  pulses  PI  and  P2.  The  pump  and 
probe  beam  sections  were  limited  by  a  circular  diaphragm  stuck  on  the  sample.  The 
diameter  of  the  hole  was  1mm.  The  pump  pulses  energy  was  varied  in  the  range  1- 
10  mJ/cm2  .  The  probe  pulse  energy  was  lower  than  0. 1  mJ/cm2.  We  measured  the 
energy  of  the  first  order  diffracted  pulse  versus  the  probe  delay.  A  diffraction 
kinetics  is  reported  in  Fig.  1 . 

Samples  preparation: 

Active  multi-quantum  well  epitaxial  structures  on  transparent  substrates  wore 
achieved  in  two  steps.  First,  the  structures  were  grown  on  GaAs  substrates  by 
molecular  beam  epitaxy.  They  consist  of  80  periods  of  alternating  GaAs  wells  and 
(Ga,Al)As  barriers,  whose  thicknesses  are  9.6  run  and  13  nm  respectively.  Second, 
the  epitaxial  structures  were  removed  from  their  substrates  and  grafted  onto  glass 
slides  by  means  of  the  lift-off  epitaxy  technique. 
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Fig.  1:  Diffraction  kinetics  in  the  first  order  Raman-Nath  configuration 
measured  on  multi-quantum  wells  GaAs/GaAlAs.  The  pump  pulse  energy 
for  PI  and  P2  are  respectively  4  mJ/cm2  and  2  mJ/cm2  . 


Experimental  results: 

The  diffraction  kinetics  exhibit  a  peak  during  the  first  200  ps  and  then  a 
minimum  within  the  interval  200-300  ps.  The  diffracted  pulse  is  very  unstable  in 
this  late  interval  due  to  the  competition  between  several  opposite  processes  (each 
reported  data  point  has  been  averaged  over  20  pulses).  After  300  ps  the  energy  of 
the  diffracted  pulse  rises  up  to  a  stable  value. 

Interpretation: 

The  peak  in  the  curve  of  Fig.  1  is  due  to  an  electronic  process.  The  presence 
of  the  electron-hole  plasma  induces  a  decrease  of  the  optical  index  in  the 
illuminated  fringes.  For  long  delays,  the  thermal  contribution  is  dominant  and 
produces  an  increase  of  the  optical  index  in  the  illuminated  fringes.  The  interval 
(200-300  ps)  corresponds  to  the  change  of  sign  of  the  optical  index.  A  model  will 
be  presented  to  support  this  interpretation. 

In  present  work,  differ ents  wavelengths  for  the  pump  pulses  are  used  in 
order  to  vary  the  penetration  depth  of  the  pump. 

These  results  are  only  a  part  of  a  more  general  study  devdopped  in  our  laboratory 
in  order  to  use  GaAs  epiiayers  and  GaAs/GaAlAs  multi-quantum  wdls  for  all- 
optical  interconnects. 
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Abstract .  Optical  nonlinearity  in  the  vicinity  of  fundaaental  absorption 
edge  in  heavily-doped  n-GaAs  and  n-InP  has  been  experinentally  investigat¬ 
ed  in  details.  The  theoretical  analysis  concerning  the  possible  nechanisns 
of  nonlinearity  observed  is  presented. 

By  an  earlier  date  it  was  reported  [1-3]  that  heavily-doped 

semiconductors  A3B5  exhibited  high  potentiality  for  use  in  bistable 

Fabry-Perot  switching  devices.  The  nonlinear  optical  properties  of  such 

semiconductors  need  to  be  understood  in  details.  In  this  paper  we  report 

the  results  of  our  further  experimental  investigations  and  some 

theoretical  considerations  of  the  optical  nonlinearity  in  heavily-doped 

bulk  samples  of  GaAs  and  InP  (the  concentrations  of  shallow  donors 
18  -  3 

n«10  cm  )  in  the  vicinity  of  fundamental  absorption  edge.  This  spectral 
region  is  of  particular  interest  because  of  the  maximum  of  the  nonlinear 
response  observed.  We  have  used  both  the  technique  of  saturation 
absorption  (to  determine  nonlinear  absorption  changes)  and  self- 
diffraction  experiments  (to  determine  refractive  index  changes).  The 
experiments  have  been  carried  out  by  using  8  ns  laser  pulses,  generated  by 
a  tunable  color-center  laser.  All  of  the  experiments  were  conducted  at 
room  temperatures. 

1.  The  transmission  spectra  of  the  semiconductor  samples  at  incident 

3  6  2 

light  power  varied  from  10  to  10  W/cm  were  measured.  The  effect  of 
optical  bleaching  below  the  band  edge  has  been  observed.  This  effect  is 
much  greater  in  samples  with  high  concentration  of  impurity  (nalO18  cm3) 
as  compared  with  medium  and  slightly  doped  ones.  The  nonlinear  response 
in  absorption  decreases  with  the  increasing  of  a  wavelength.  For  instance, 
in  n-InP:Sn  sample  (n«1018cm*3)  the  saturation  intensity  increases  from 
0.17  MW/cm2  at  wavelength  935  nm  to  0.88  MW/cm2  at  955  nm).  To  the 
contrast,  in  semi-insulating  samples  the  increase  of  absorption  has  been 
observed  in  f he  same  spectral  region. 

ii.  The  self -diffract  ion  measurements  have  been  carried  out  at  the 
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conditions  of  the  Raman-Nath  diffraction.  In  most  experiments  the 
multy-order  diffraction  (up  to  the  fourth  diffraction  order)  took  place. 
The  spectral  and  energy  dependencies  for  the  1st,  2d  and  3d  diffraction 
orders  were  aeasured  for  a  number  of  doped  sealconductor  samples.  It  has 
been  found  that  the  self-diffraction  efficiency  reaches  the  aaxiaua  at 
wavelengths  slightly  below  the  band  edge.  Our  estimates  give  the  value  of 
nonlinear  refraction  index  change  of  about  0.002  at  incident  power  of  200 
KW/ca 2  for  InP:Sn  (n*1018cm-3) . 

lii.  Theoretical  analysis  of  possible  mechanisms  that  are 
responsible  for  the  nonlinear  optical  absorption  observed  is  presented. 
The  nonlinearity  considered  is  assumed  to  involve  the  following  effects: 
band  filling,  band  gap  shrinkage  and  screening  of  randomly  distributed 
impurity  potential,  as  a  result  of  optically  generated  electron-hole 
plasma.  It  is  supposed  that  the  nonlinear  optical  absorption  can  be 
analyzed  on  the  basis  of  the  theory  of  linear  interband  optical  absorption 
in  case  of  disordered  potential  [4],  by  taking  into  account  the  dependen¬ 
cies  of  semiconductor  parameters,  such  as  band  gap  energy,  electron  and 
hole  Fermi  quasi-levels  and  screening  length,  on  the  density  of  photo- 
generated  carriers.  This  method  enables  us  to  consider  the  absorption  edge 
broadening  caused  by  both  the  Franz-Keldysh  effect  in  the  internal  field 
of  randomly  distributed  ionized  impurity  atoms  and  inhomogeneous  band  gap 
shrinkage.  In  Numerical  simulation  of  the  process  of  a  nonlinear 
interaction  of  laser  pulses  with  semiconductor  samples  we  use  preliminary 
calculated  dependencies  of  the  absorption  coefficient  on  the  concentration 
of  the  photoinduced  plasma.  The  comparison  of  calculated  and  experimental¬ 
ly  measured  spectral  and  energy  dependencies  of  transmission  allows  to 
clarify  the  contribution  of  different  mechanisms  in  nonlinearity  observed. 
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Abstract 

Two  key  components  for  future  photonic  networks  using  wavelength 
multiplexing  are  designed  using  commercially  available  popular  technologies. 
These  components  handle  wavelength  conversion  and  filtering.  Design  parameters 
and  design  results  are  introduced. 

1 .  Introduction 

The  first  step  towards  a  photonic  switching  network  will  probably  be  the 
introduction  of  frequency  multiplexing  into  a  subscriber  access  fiber  line.  Most 
research  level  devices  are  not  adequate  yet  because  they  are  too  expensive,  not 
reliable  enough  and  hard  to  find.  This  paper  describes  two  key  components  for 
wavelength  conversion  and  filtering  designed  using  currently  available 
technologies.  The  components  are  experimental  products,  therefore  it  is  assumed 
that  further  improvements  will  be  needed  by  having  evaluation  result  such  as 
applying  new  device  technologies.  The  design  specifications  were  : 

(a)  Wavelength  band  :  1.1  -  1.6  pm 

(b)  Number  of  multiplexed  waves  :  eight 

(c)  Minimum  Wavelength  spacing  :  20  nm 

(d)  Fiber  mode  :  Multi-mode  (50  pm  core  diameter) 

(e)  Maximum  signal  speed  (design  target) :  622  Mbps  ( >lGHz) 

(f)  Transmission  distance  (design  target) :  5  -  10  km 

Selected  LD  (Laser  Diode)  wavelengths  were  1175-1190,  1210-1225,  1275- 
1285,  1305-1310,  1330-1340,  1505-1525,  1545-1550, 1570-1580pm  . 

2.  Wavelength  Conversion  Component 

This  converts  a  signal  of  one  wavelength  to  one  of  another  wavelength, 
under  instructions  from  wavelength  selection  port.  The  incoming  signal  is  detected 
by  a  photodiode,  the  electronic  signal  is  amplified,  a  LD  driver  activates  an 
appropriate  LD  out  of  eight,  and  the  output  light  is  guided  to  the  output  fiber 
through  a  coupling  device. 

(a)  Input  signal  level :  >  -30  dBm  (using  an  InGaAs-APD) 

(b)  Amplifier  gain  :  30  dB 

(c)  Modulation  transparency  :  digital  /  analogue  (by  bias  selection) 
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(d)  UD  output  signal  level  :  0  dBm  (using  a  commercially  available  Fabry  Perot 
resonator  type  LD  bare  chip) 

(e)  Fiber  output  signal  level :  -10  dBm 

(0  Signal  selection  port :  TTL  level  (selecting  one  out  of  eight ) 

(g)  Size  :  120x66x40  mm  (excluding  connectors  and  pigtails) 

(h)  Power :  +5,  -5,  +80  volts 

(i)  Others  :  power  feed  connector,  channel  selection  monitor  LED,  input  signal 
monitor  LED. 

3.  Filtering  Component 

This  receives  a  signal  with  eight  wavelengths  multiplexed  on  it,  and 
separates  out  and  guides  the  signal  at  each  wavelength  to  an  output  port  for  that 
wavelength.  It  has  no  active  device.  The  input  light  signal  from  the  fiber  is 
converted  into  concurrent  beams  by  several  lenses.  Each  time  the  beam  arrives  at 
a  filter,  it  is  divided  into  signals  of  two  different  wavelengths,  one  passes  through 
to  the  output  port  and  the  others  is  reflected  to  the  next  filter.  The  output  ports  have 
a  lens  circuit  which  guides  the  light  to  an  output  fiber. 

(a)  Number  of  new  lens  designs  :  eight 

(b)  Filter  fabrication  :  dielectric  multi-coated  filter 

(c)  Number  of  filter  types  :  16 

(d)  Insertion  loss  :  less  than  10  dB  (design  target) 

(e)  Size  :  26x26x18  mm  (excludes  connectors  and  pigtails) 

4.  Design  Evaluation 

The  performance  was  measured  and  results  will  be  introduced  at  the 
conference.  More  than  forty  of  each  component  were  produced.  Their  cost  was 
reasonable. 

Eight  wavelengths  is  an  adequate  number  at  presents.  If  the  number 
increases,  it  will  be  difficult  to  purchase  commercially  available  LD ,  and  the  loss 
of  the  coupling  device  will  also  increase. 

There  is  a  technical  bottleneck  at  connection  between  LD  and  coupling 
device.  Variations  in  LD  alignment  cause  a  loss  of  more  than  10  dB.  A  simple 
solution  to  this  problem  is  required. 
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ABSTRACT 

We  present  a  new  multichannel  heterodyne  optical  communication  line  based  on  a 
family  of  high-stable  monolithic  neodymium  mini-lasers  pumped  by  the  single  laser. 

Introduction 

Diode  lasers  emission  spectra  demonstrate  far  from  full  accordance  with  neodymium  ions 
absorption  spectra.  Moreover,  strong  temperature  dependence  of  the  form  and  width  of 
emission  spectrum  of  the  semiconductor  laser  (AX/ AT  ~  0.3nm/K)  makes  it  necessary  to  use 
thermostabilizing  devices,  which  complicates  the  design  of  laser  and  decreases  total  efficiency. 
For  pumping  with  diode  arrays  the  additional  aspects  of  the  variation  in  the  central  wavelength 
and  bandwidth  from  stripe  to  stripe  must  be  considered.  Variation  of  neodimium  absorption 
line  center  wavelegth  in  various  laser  materiales  complicate  simultaneousl  pamping  of  a  number 
of  different  lasers  using  only  one  laser V 

Recently  we  demonstrated^  the  possibility  to  pump  via  Cr^+  a  family  of  various 
neodymium  doped  laser  hosts  using  the  only  one  laser.  A  set  of  such  mini-lasers  is  useful  in 
multichannel  optical  communication  lines.  The  comparative  performance  of  a  set  of  laser- 
pumped  mini-lasers  based  on  Nd^+,  Cr^rREG  (Rare-Earth  Garnets)  are  presented.  The 
prototype  of  a  new  multichannel  communication  line  with  information  transfer  rate  estimated  to 
be  of  1000  Gbit/sec  is  described. 

Experimental  results 

Different  semiconductor  AlGaAs  lasers  or  argon  laser  were  used  pump  sources  of 
different  Nd,  Cr-codoped  crystals:  GGG  (  X  =  1062.1  nm),  GSGG  (  X  =  1061.2  nm), 
YSGG  (  X  =  1058.5  nm),  GSAG  (  X  =  1060.2  nm)  and  several  garnet  mixtures 
(GSGG/GSAG  and  YSGG/GSAG  type)3. 

We  have  tested  the  following  active  elements:  GGG:Cr,Nd  ;  GSGG:Cr,Nd; 
YSGG:Cr,Nd;  GSAG:Cr,Nd.  Cr3+  and  Nd3+  ions  concentration  in  the  crystals  can  be 
varied  from  1x10^0  to  5xl020cm"3  depending  on  the  absorption  efficiency  of  radiation 
pumping. 

A  set  of  four  linear  monolithic  mini-lasers  united  in  one  block  and  simultaneously  or 
separately  pumped  by  the  only  one  argon  laser  was  created.  The  frequency  shift  (AFl) 
between  different  lasers  was  ~  100  GHz.  On  the  base  of  this  block  of  the  lasers  the  model  of 
optical-fiber  communication  line  was  developed.  The  scheme  of  such  optical-fiber 
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communication  line  is  shown  in  Fig.  The  emission  of  four  lasers  is  independently  modulated  by 
high-frequency  modulators. 

The  modulated  emission  was  inserted  through  the  optical  connectors  in  one  optical  fiber. 
The  radiation  emitting  from  optical  fib  distant  end  is  mixed  with  the  emission  of  a  similar 
discrete-tuned  laser.  The  discrete-tuned  laser  is  a  laser- heterodyne,  which  consists  of  the  same 
set  of  solid  state  mini-lasers.  Due  to  the  mixing  of  the  laser-transmitter  beam  with  the  laser- 
heterodyne  beam  on  the  broad-band  photodiode  the  beat  frequency  signal  is  formed.  The  beat 
signal  carries  useful  information.  Used  photodiode  band  width  AFpd  was  approximately  equal 
to  ~  5  GHz,  that  is  considerably  less  than  the  difference  between  frequencies  of  the  separate 
lasers  AFl*  The  photodiode  selects  the  information  from  only  one  channel,  which  matches  with 
the  frequency  of  the  laser-heterodyne  optical  frequency.  Thermal  tuning  of  the  laser- heterodyne 
was  used  to  get  optical  frequency  to  operate  at  nearly  the  same  frequency  as  a  laser-transmitter 
made  of  the  material  of  the  same  composition.  The  monolithic  linear  lasers  bandwidth  was  less 
than  5  kHz.  The  heterodyne  feed-back  signal  is  used  for  the  automatic  thin  tuning  of 
frequency  of  the  laser-heterodyne.  Information  transfer  rate  using  such  multi-channel 
communication  line  with  the  family  of  solid  state  mini -lasers  on  the  base  of  GSAG/YSGG 
solid  mixtures  (AXNd  ~  3  nm)  has  the  order  of  lO^bit/sec. 

Emitting  (1)  and  heterodyne  (6) 
mini-laser  families; 

RF  modulators  (2); 
optical  fiber  connectors  (3); 
optical  fiber  line  (4); 
optical  mixer  (3); 
piezzo  elements  (7); 
temperature  stabilizer  (8); 
broad-band  photodetector  (9); 

RF  heterodyne  (10); 
tunable  RF  oscillator  (11) 
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ABSTRACT.  Photoconducting  polyimides,  phthalocyanines  and  its  polymer 
dispertions  have  been  studied  as  a  photosensitive  elements  for  liquid 
crystal  spatial  light  modulators.  High  resolution,  low  active  losses 
and  excellent  physico-mechanical  properties  are  the  main  advantages  of 
the  light  valves  investigated. 

SUMMARY.  Organic  photoconductors  are  promising  materials  for  optoelec¬ 
tronic  processes  and  devices  (1,2).  Photosensitive  characteristics  of 
polyimides,  phthalocyanines  and  its  polymer  dispertions  in  various  po¬ 
lymer  matrixes  were  investigated  by  photoconductivity,  photoelectromo¬ 
tive  force  and  electrophotographic  methods.  The  diverse  types  of  sensi¬ 
tizers  were  used  for  expantion  the  photosensitivity  to  the  red  and  near 
infrared  regions  of  spectrum.  The  best  results  were  obtained  with  phtha- 
locyanine  type  sensitizers  with  sensitivity  up  to  IOm^/J.  Phthalocyanine 
dispertions  in  nonphotoconducting  matrixes  were  also  studied  with  photo¬ 
sensitivity  in  the  visual  and  near  infrared  regions  of  the  spectra.  The 
photosensitivity  in  such  systems  is  mainly  due  to  the  photoelectronic 
processes  in  molecules  and  aggregated  forms  of  the  pigment.  The  traces 
of  the  solvent  have  strong  influence  on  the  photosensitivity  spectra. 

All  the  systems  investigated  may  be  successfully  used  in  optically  ad¬ 
dressed  liquid  crystal  spatial  light  modulators  as  photosensitive  ele*- 
ments  and  matrixes  for  dispersed  liquid  crystals. 

The  liquid  crystal  light  valves  with  organic  photoconducting  films  which 
investigated  had  high  spatial  resolution,  low  active  losses  and  excel¬ 
lent  physico-mechanical  properties.  All  the  results  obtained  are  discus¬ 
sed  as  applied  to  the  photonic  information  processing. 

I.V.Mylnikov,  A.Slusar.,  Mol.  Cryst.  Liq.  Cryst.,  1992,  v.222,  p.  125 
2y.Mylnikov. ,  Photoconductivity  cf  polymers.,  Leningrad,  Chimia,  1990 
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Abstract  The  possibility  of  self-focusing  and  diffractionless  propagation  of  light  beam  in  linear  biaxial 
gyrotropic  crystal  has  been  shown.  Asymmetrical  components  of  Kerr-like  nonlinearity  tensor  being  taken 
into  account  may  lead  to  instability  of  slow  component  of  fiber  mode  and  also  to  self-rotation  of  polarization 
plane  of  linearly-polarized  incident  light. 

An  account  of  anisotropy  is  essential  in  various  kinds  of  linear  and  nonlinear  schemes  of  Optical  In¬ 
formation  Processing.  Here  we  study  forming  of  spatial  structure  of  light  beam  in  linear  medium  with 
anisotropy  of  a  specific  kind  and  nonlinear  interaction  of  modes  in  a  fiber  with  Kerr-like  nonlinearity. 

1.  For  focusing  or  diffractionless  propagation  of  light  beams  negatively  or  sero  curved  wave  vectors  surface 
is  necessary.  Biaxial  gyrotropic  crystals  are  unique  media  in  this  respect.  For  qualitative  description  of 
these  effects  we  use  the  following  expansion  for  the  wave  vector  k  =  k,(q)n  +  q,  k,(q)  =  io«o  —  ~ 

qwq/2,  where  kM(q)  -  projection  of  wave  vector  of  parcial  wave  on  beam  axis  direction  n,  u  is  the  vector 
of  group  velocity,  w  -  curvature  tensor  of  wave  vectors  surface,  v  =  un,  no  -  phase  velocity  and  refraction 
index  in  the  direction  of  beam  axis. 

From  Maxwell  equations  with  anisotropy  and  gyrotropy  being  taken  into  account  one  may  obtain  the 
following  expressions  for  principal  values  of  curvature 


Wi  =  U»2  = 


~l  +  '? 


c0  2  —  (e3  1  “  e2  1)(c2  1  ~  el  X)> 


(1) 


*1, 2, 3-  principal  values  of  tensor  of  dielectric  perceptibility. 

The  aberrations  described  by  the  following  division  term  in  expansion  for  the  wave  vector  k  have  been 
appreciated.  They  limit  angular  aperture  0ra  of  a  spherical  part.  For  a  —  HJO3  crystal  and  for  X  =  10~6m, 
length  L  =  10~2m  the  estimation  gives  0m  =  7.1  *  10' 3  rad. 

Diffraction  properties  of  light  in  crystalls  like  a  —  HIO3  are  considered  at  gaussian  beam  example. 
Calculation  gives  the  following  expressions  for  its  width  and  radius  of  curvature: 


where  z  =  z\  4-  nozj$/2,  zi  is  the  distance  from  caustic  to  the  front  face  of  the  crystal,  2j  is  the  distance 
passed  by  light  inside  the  crystal.  From  (2)  one  can  see  if  *  =  0  beam  trapping  and  if  ♦  <  0  beam 
focusing  occurs.  In  the  last  regime,  in  dependence  on  $,z,  L  -  relationship,  may  take  place  focusing  either 
at  output  face  of  the  crystal  or  inside  the  one  or  double  focusing  both  inside  and  outside  the  crystal. 

Note  considered  effects  of  focusing  may  occur  for  o  —  HIO3  crystal  for  which  4  =  —0.175. 

2.  Interaction  of  orthogonal  linearly  polarised  modes  in  nonlinear  fibers  has  been  considered  with  asym¬ 
metrical  components  of  xs  tensor  being  taken  into  account.  E.g.,  for  Djh  symmetry  with  X12  getting  into 
account  we  obtain 
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gq  _ 

=  +  +  (2*« 1 *  I1  +xn  I  °  lJ  +fc)a>  (3-») 

db  , 

— =  Uli  —  *Xn)^’a  +  (2x66  I  o  |J  +X22  I  &  lJ  -4)4,  (3-b) 

where  k  is  proportional  to  linear  anisotropy.  Questions  concerning  stability  of  slow  mode  are  studed  below. 

We  introduce  both  the  parameters  of  anisotropy  Au  =  (x22-Xn)/2,  An  =  2x66+Xi2-(Xi;+X22)/2 
and  parameter  *J,  characterising  relative  level  of  excitation  of  orthogonally-polarised  modes.  When  x%  —  0, 
slow  mode  is  excited,  which  used  to  be  stable  without  the  account  of  Kerr-type  anisotropy  [1,2].  Now  it 
losses  stability  when  (24  -  Auj/Au  <  1,  where  I  =  4/ag,a$  -  the  total  intensity. 

Under  self-coordinate  change  of  Zq  and  aj,  when 


x 


2 

0 


A11  —  24  \ 
An  )' 


(4) 


slow  mode  keeps  the  stability  while  intensity  increases  either.  Polarisation  of  mode  is  elliptical  and  is 
changed  bom  linear  one  on  K-axe  (polarisation  of  slow  mode)  when  24  =  An  —  An  to  linear  one  on 
-X-axe  when  2 I  =  An  +  An. 

The  account  of  x?«  i»  *1»°  carried  out  in  the  study  presented.  In  that  case  nonlinear  interaction 
of  two  circular  polarised  modes  doesn’t  change  their  amplitudes,  but  affects  only  their  phase  difference 

*  =  v>+  -<*>-•■ 


=  -Xn(|a+  I1-  I  a-  1J)  *4-  X16 (l  a+  I2  +  I  a-  |2)-  (5) 

As  appears  bom  (5),  nonlinear  phase  shift  and,  therefore,  self-rotation  of  polarisation  plane,  in  com¬ 
parison  with  the  case  of  xi6  =  0,  occurs  for  linearly-polarised  incident  light  either. 
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Abstract 

We  report  the  experimental  results  of  a  64  chan¬ 
nel,  high  data  rate,  free-space  interconnection  net¬ 
work  for  massively  parallel  processing  architecures. 
It  uses  VCSEL  arrays,  photodetector  arrays  and  a 
passive  optical  routing  network. 

1  Introduction 

In  considering  the  massively  parallel  comput¬ 
ers  of  the  future,  it  is  apparent  that  there  are  sig¬ 
nificant  difficulties  in  using  electrical  interconnec¬ 
tion  methods.  Peak  data  rates  in  excess  of  100 
Gbit/sec  are  typical  for  a  single  processor  with 
a  64  or  128  bit  wide  word.  Inter-processor  data 
rates  are  presently  in  the  range  of  1-10  Gbit/Sec, 
although  clearly  this  will  increase  with  higher  pro¬ 
cessor  speeds.  Coping  with  such  data  speeds  over 
inter-board  distances,  whether  in  single  channel  or 
word-wide  buses,  is  becoming  increasingly  difficult 
with  electrical  interconnections. 

2  Goals 

Our  goal  is  to  develop  a  system  capable  of 
achieving: 

1)  Interconnection  of  10,000  processors  in  an  inter¬ 
connection  cache  architecture. 

2)  Proce8sor-to-proceasor  data  rates  of 
l-10Gbit/sec. 

Many  optical  interconnection  schemes  have 
been  suggested,  but  often  have  difficulties  of  very 
demanding  optics  (SEED  systems),  or  require 
diffractive  elements  (and  hence  wavelength  con¬ 
trol),  or  use  large  central  switches  (such  as  SLMs 
and  photorefractive  crystals)  which  do  not  scale 
well.  We  have  attempted  to  make  the  best  use 
of  the  optical  and  electronic  components  currently 
available.  The  optics  is  used  only  as  an  efficient 
method  for  communication,  while  signal  genera¬ 
tion,  routing  and  detection  is  all  done  using  elec¬ 
tronics.  Beam  combination  and  splitting  is  done 
by  simple  mirror  surfaces  (no  space-variant  redi¬ 
rection),  and  relaying  is  done  by  simple  optics,  on- 
axis  wherever  possible,  to  reduce  the  build-up  of 
aberrations. 

3  System  Overview 

Our  experimental  system  (figure  ??)  presents 
the  one-way  paths  of  the  optical  channels.  All  pos- 
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Figure  1:  Schematic  of  the  optical  system. 


Bible  routings  to  a  the  target  detector  are  shown. 
The  experiment  has  4  ‘columns’  each  with  one  sim¬ 
ulated  ‘board’  of  16  processors  driving  an  8x8  ele¬ 
ment  VCSEL  array.  Data  is  received  at  the  board 
by  a  4x4  photodiode  (PD)  array.  Note  that  in  the 
largest  possible  system,  each  processor  would  have 
one  VCSEL  array  and  a  single  photodiode  (and  the 
passive  optics  would  have  to  be  replicated  as  many 
times  as  there  are  processors  on  a  board). 

Data  is  carried  on  arrays  of  parallel  slow- 
gaussian  beams  from  the  VCSELs  and  relayed  by 
microlens  and  micromirror  arrays  and  simple  bulk 
lenses.  The  experimental  system  is  constructed  on 
a  ‘slot-rail’  system  which  allows  modular  assem¬ 
bly,  minimal  alignment  and  good  stability,  similar 
to  slot-plate  systems  already  demonstrated. 

Each  optical  channel  is  fixed  and  terminates  at 
the  receiving  photodiode  of  one  processor,  so  that 
there  are  N  channels  for  N  processors.  Each  pro¬ 
cessor  selects,  via  an  electronic  switch,  the  appro¬ 
priate  laser  that  transmits  onto  the  channel  of  the 
target  receiver  PD. 

Since  a  large  number  of  processors  must  have 
access  to  the  same  PD,  an  optical  fan-in  problem 
exists.  Fan-in  occurs  at  each  beamsplitter  cube. 
To  optimise  the  fan-in  power  efficiency,  the  reflec¬ 
tivity  at  each  beamsplitter  must  be  controlled,  so 
that,  beginning  at  the  most  distant  processor  from 
the  target,  the  coupling  efficiencies  onto  the  chan- 
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nel  are  1,  ..  for  a  fan-in  of  M.  Thia  gives 

a  -g  efficiency  for  every  connection.  However,  be¬ 
cause  all  the  channels  within  a  combining  cube  are 
generally  at  different  distances  from  their  targets, 
the  coupling  efficiencies  for  each  channel  within  a 
single  cube  are  also  different.  Hence,  micromirror 
arrays  are  used  at  the  center  of  each  cube, 
the  channels  are  kept  spatially  separate.  Th 
the  main  reason  for  the  slow-gaussian  micro! 
approach. 

The  key  advantages  of  this  system  are: 

1)  Distributed  nature  -  no  central  switch  or  control 
needed. 

2)  The  microbeam  approach  is  scalable. 

3)  Only  one  laser  per  LDA  is  on,  so  low  LDA  power 
dissipation,  reliable. 

4)  Speed  limited  by  LD/PD  technology,  electronic 
drivers  and  amplifiers. 

4  Seeding 

To  scale  up  the  experimental  system,  M  boards 
(e.g.  M  =  10’s)  are  to  be  added  in  each  column, 
giving  a  column  channel  efficiency  of  -fa .  20  boards 
results  in  a  5%  efficiency,  sufficient  for  our  tar¬ 
get  data  rates  with  lmW  lasers.  However,  whole 
columns  must  also  be  cascaded  to  get  the  target 
processor  numbers.  Therefore  channel  efficiency 
drops  rapidly  as  columns  are  added.  This,  along 
with  the  difficulty  of  maintaining  alignment  and 
aberration  means  that  ‘repeaters’  will  be  necessary 
every  few  columns.  These  may  be  arrays  PD/LD 
hybrids,  devices  with  gain  (such  as  VSTEPS)  or 
solid  state  amplifiers. 


5  Optics 

Diffraction  limits  the  dimensions  and  hence  the 
total  number  of  distinct  microbeams  that  can  be 
supported  within  a  cube.  The  relation  giving  this 
maximum  number,  Nmmx ,  is 


— 


rnD 
4  Am2 


(1) 


where  n  is  the  cube  refractive  index,  D  the  cube  di¬ 
mension  and  m  the  multiplier  for  beam  pitch  2mr0 
(where  r0  is  the  jj  radius)  at  the  cube  surface.  Fig¬ 
ure  ??  shows  the  relation  between  Nmax  and  X  for 
m  =  1.5  at  various  cube  sixes.  It  illustrates  the 
high  numbers  of  channels  that  can  be  used  with 
compact  (<30mm)  optics. 

In  our  experiment,  the  850nm  beams  emitted 
from  VCSEL  arrays  have  a  250pm  pitch,  and 
are  collimated  by  ion-exchange  planar  microlenses. 
Some  micromirror  arrays  are  conformal  HOEs 
made  in  dichromated  gelatin,  with  controlled  ef¬ 
ficiencies  and  optimised  for  850nm,  and  some  are 
evaporated  patterned  metal. 

Board- to- board  relaying  (20-50mm)  within  a 
column  is  done  by  /  =  14mm,  16  level  diffractive 
microlens  arrays  with  a  typical  efficiency  of  90%. 


Figure  2:  Max  number  of  channels  for  m  =  1.5. 


The  microlens  arrays  are  bonded  to  the  surface  of 
each  cube.  The  distance  from  column  to  column  is 
larger  («  300mm  for  a  board  width),  and  conven¬ 
tional  bulk  4  ‘  can  be  used  in  a  4-f  imaging 

arrangeme  , ,  a  <  Individual  beams  have  ex¬ 
tremely  smat.  numerical  apertures  (NAs),  easing 
imaging  requires  >  is 

The  theoretical  aberrations  of  this  system  are 
very  low,  because  NAs  are  very  low  (<  0.01), 
and  much  of  the  imaging  is  on-axis.  Exact  ray¬ 
tracing  simulations  around  the  worst  case  path  of 
the  experimental  system  shows  negligible  aberra¬ 
tions  (strehl  ratio  «  1).  The  telecentric  4-f  bulk 
lens  imaging  system  adds  only  significant  spherical 
aberration,  which,  in  our  case,  results  in  a  beam 
pointing  and  position  error.  Simulations  show  that 
for  our  standard  40mm  lenses  at  full  7mm  field,  the 
pointing  error  is  only  0.025®  and  position  0.69pm. 
For  a  full  size  system  using  80mm  lenses  and  20mm 
cubes,  full  15mm  field  has  only  0.028®  and  2.1pm 
errors.  After  8-f  errors  are  0.057®  and  8.6pm,  in¬ 
dicating  repeaters  could  be  placed  every  2  or  3 
columns.  Optimized  lenses  would  give  improved 
performance,  potentially  allowing  greater  separa¬ 
tion. 

Current  experimental  results  will  be  reported. 


a — i — b - 1 — i — i - a — i — h 


Figure  3:  OSLO  model  of  system,  and  longest  path 
PSFs  at  1.4  (operating)  and  5mm  field  (1.4mm 
patches). 
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Abstract 

Initial  results  on  the  performance  of  a  64  input  64  output  free-space  optical 
crossbar  are  reported.  The  components  of  the  compact  and  rugged ised  system 
are  described. 

The  use  of  free-space  optics  to  perform  high  bandwidth  connections  for  computing 
applications  has  been  widely  discussed  in  the  literature.  In  this  paper  we  present  the 
practical  results  from  a  64  x64  free-space  optical  crossbar  switch  which  has  been  developed 
as  part  of  a  collaborative  project  entitled  OCPM  [1]. 

Figure  1  shows  the  layout  of  the  64  x64  system.  A  matrix-matrix  crossbar  design  has 
been  used  and  data  inputs  to  the  switch  are  provided  by  790nm  laser  diodes  pigtailed  to 
polarisation  preserving  fibres.  To  create  the  most  compact  system,  the  design  uses  one- 
to-one  imaging  and  hence  the  input  fibres  are  arranged  in  an  array  that  matches  the  pixel 
spacing  of  the  SLM.  These  are  80pm  cladding  diameter  Hi-Bi  fibres  which  are  arranged  in 
an  8x8  square  array  with  a  120pm  pitch.  Each  fibre  has  its  polarisation  axes  orientated  in 
the  same  direction.  This  fibre  array  was  fabricated  in  an  array  of  holes  formed  by  excimer 
laser  material  ablation  of  a  Kevlar  substrate  [2].  The  light  from  the  input  array  is  fanned 
out  by  means  of  a  binary  phase  grating  based  on  a  non-separable  two-dimensional  design. 
A  half-wave  plate  is  used  to  align  the  polarisation  of  the  channels  with  the  liquid  crystal 
axes  of  the  SLM. 

The  relay  lenses  have  effective  focal  lengths  of  ~40mm  and  a  16°  full  field  of  view. 
Code-V  has  been  used  to  design  and  optimise  the  system  and  predicts  that  90%  of  the 
input  falls  within  10.5pm  diameter  circles  after  two  relay  stages  [3].  The  contrast  ratio  of 
the  polarising  beam  splitter  limits  the  possible  signal  to  crosstalk  ratio  of  the  switch.  Due 
to  the  large  range  of  working  angles  it  was  necessary  to  incorporate  a  second  polariser  in 
the  output  arm  of  the  crossbar.  Alignment  of  the  components  is  achieved  by  means  of 
accurate  positioning  of  the  optics  within  cylindrical  mounts  which  are  held  in  place  by 
magnetic  strips  within  channels  machined  into  the  baseplate.  Beam-steering  prisms  are 
used  in  the  output  arm  of  the  crossbar  to  provide  fine  adjustment  of  the  optical  axis. 

The  64  x64  SLM  is  currently  being  fabricated  for  inclusion  in  the  system  at  a  later 
date.  This  device  will  have  ~70pm  square  pixels  on  a  120pm  pitch.  It  will  use  a  chiral 
smectic  C*  liquid  crystal,  operating  in  the  binary  latching  bookshelf  structure.  This  will 
be  an  active  backplane  device  with  DRAM  pixel  circuitry  and  a  switching  speed  of  ~10 
-  25ps  is  expected.  Until  this  device  becomes  available,  initial  system  tests  are  being 
performed  with  an  SLM  of  the  same  dimensions  but  containing  just  64  pixel  stripes. 
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Figure  1:  Lay-out  of  the  64x64  crossbar  baseplate  and  optics 


The  fanning-in  of  the  light  into  the  output  fibres  is  performed  by  two  lenslet  arrays. 
These  have  been  combined  to  form  a  single  high  efficiency  holographic  element.  To  in¬ 
crease  the  collection  efficiency  of  the  fanning-in  to  the  output  fibres,  300pm  core  diameter 
fibre  is  used.  These  fibres  are  formed  into  a  square  8x8  array  with  a  960pm  pitch  using 
an  array  of  holes  that  are  anisotropically  etched  in  silicon. 

The  technology  for  fabricating  compact,  high  bandwidth,  non-blocking  free-space  op¬ 
tical  crossbar  switches  capable  of  full  broadcast  and  multi-cast  has  been  developed  and 
demonstrated.  The  performance  of  this  64x64  crossbar  for  switching  data  up  to  a  band¬ 
width  of  lGbit/s  is  to  be  evaluated  and  the  subsequent  results  will  be  presented  at  the 
conference. 
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We  demonstrate  a  representative  portion  of  an  optical  backplane  using  free-space 
optical  channels  to  interconnect  printed  circuit  boards  which  employ  FET-SEED  based  smart 
pixel  arrays.  Results  of  system  demonstrator  performance  will  be  presented. 


Free-space  optical  interconnects  represent  a  solution  to  the  needs  of  future  connection 
intensive  digital  systems  such  as  ATM  switching  systems,  and  massively  parallel  processing 
computer  systems.  These  systems  will  require  the  large  board-to-board  connectivity  provided 
by  an  optical  backplane  created  with  two-dimensional  arrays  of  passive,  free-space.  Parallel 
Optical  Chaimels  (POCs)  to  optically  interconnect  electronic  Printed  Circuit  Boards  (PCBs) 
and/or  Multi-Chip  Modules  (MCMs).  Such  a  backplane  could  be  capable  of  supporting 
terabit/second  aggregate  capacities  with  connectivity  levels  on  the  order  of  10,000 
input/output  channels  per  PCB. 


FRTSF.F.n  T™n{mjprr/Receiwrr 

We  have  developed  the  optics  and 
optomechanics  to  demonstrate  these  high  bit 
rate  optical  backplanes.  As  part  of  this 
program,  we  have  constructed  a 
representative  portion  of  an  optical 
backplane  capable  of  interconnecting  two 
printed  circuit  boards  which  utilize  FET- 
SEED  based  smart  pixel  transceiver 
arrays.*1)  Figure  la  shows  a  schematic  of 
the  transmitter  circuit,  including  the  optical 
diodes  and  the  FETs  used  to  modulate  the 
optical  input  Figure  lb  is  a  schematic  of  the 
receiver  circuit  including  the  optical  input 
windows,  clamping  diodes  for  voltage 
control,  the  differential  amplifier  section  of 
the  circuit  and  the  power  FETs  which  are 
designed  to  drive  100  ohm  transmission 
lines  on  die  PCBs.  Both  the  4  x  4  transmitter 
and  receiver  array  optical  windows  are  25  x 
25  pm,  separated  by  50  pm,  with  the  pixel 
to  pixel  pitch  being  set  at  200  pm.  These 
circuits  are  designed  to  run  at  155  MBits/sec 
simulation  of  the  receiver  circuit  indicating 


i  parallel.  Figure  2  shows  the  results  of  a  Spice 
cut-off  frequency  of  1.4  GHz.  Experimentally 
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measured  device  and  circuit  parameters  will  be  presented 
including  measurements  of  not  only  individual  pixel 
performance  but  also  the  4  x  4  array  speed  and  efficiency. 
These  results  will  be  compared  to  the  simulator  results  to 
confirm  the  accuracy  or  our  models. 


€IWIK 


The  board-to-board  interconnection  will  be 
accomplished  using  a  two-sided  optical  backplane 
approach,  shown  in  Figure  3.®  This  first  demonstrator  is 
capable  of  interconnect  two  PCBs  using  the  4  x  4  smart  Frequency  (Hi) 

pixel  arrays  described  above.  Specifically,  light  from  the  „  _  _ 

output  of  an  argon  laser  pumped  Ti:Sapphire  laser  is  Figure  l-  Receiver  Circuit  Respocue 

delivered  and  collimated  using  single  mode  fibers  and 
collimating  optics.  A  binary  phase  grating  is  used  to 

provide  spot  array  generation  for  illuminating  the  smart  ki.mcm  I  l.**,  Hi 
pixel  array  optical  windows.  Vertically  polarized  light  is  V  »«  H 

reflected  off  the  polarization  beam  splitter,  directed  /rm 

through  a  quarter  waveplate,  and  focused  onto  the 
transmitter  array.  The  transmitter  circuit  modulates  the 
light,  reflecting  the  light  back  towards  the  adjacent 

printed  circuit  board  through  the  polarizing  optics.  The  VnjjjjjjjjjjjMpV 

modulated  light  is  detected  by  the  receiver  smart  pixel 
photodiodes,  and  amplified  using  power  FETs.  These  T2T  I  I 

signals  are  then  directed  onto  the  PCB  for  appropriate  1  | 

processing.  The  optomechanics  for  this  first  demonstrator  ^  ^ 

is  based  on  a  modified  AT&T  baseplate  approach.  (3)  The  _.  „  ^ 

baseplate  is  made  from  Magnesium  to  insure  extreme  Figure  3:  Demonstrator  Schematic 

flatness  tolerances.  The  baseplate  design  is  capable  of 

implementing  both  a  bulk  optics  approach,  and  a  lenslet  array  based  approach  for  creating  the 
passive  optical  channels  used  to  interconnect  the  PCBs.  Tolerancing  issues  such  as  optical 
component  alignment  and  PCB  optomechanics  will  be  discussed. 

Results  of  system  measurements  will  be  presented  and  discussed.  These  measurements 
include  characterization  of  optical  throughput,  die  associated  optical  efficiency,  and  spot  array 
uniformity.  In  addition  optical  crosstalk  and  bit  error  rates  will  be  presented.  Finally,  short 
and  long  term  stability  will  be  described  with  an  emphasis  being  placed  on  the  ultimate 
manufacturablity  of  this  approach  being  addressed. 


Figure  3:  Demonstrator  Schematic 
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Abstract 

An  S-SEED  switching  experiment  has  been  designed  and  constructed  to  investigate  operation  at  5-50  MHz 
using  powers  of  1  mW/Uevice  over  a  512  device  array.  Details  of  the  implementation  and  experimental  results 
will  be  presented. 


Summary 

Arrays  of  optically  interconnected  self-electrooptic  effect  devices  (SEEDs)  have  been  proposed  for  use  in  optical 
computing  systems  to  overcome  the  problems  encountered  in  high  speed  electronic  systems;  e.g.  clock-skew, 
limited  interconnect  possibilities,  cross-talk,  etc.  Individual  S-SEED  devices  have  been  switched  at  130  MHz 
[1]  and  the  degradation  in  their  performance  that  occurs  at  high  powers  has  been  measured[2].  Large  arrays 
have  only  been  operated  at  below  1  MHz [3].  It  is  important  to  demonstrate  the  system  capabilities  of  these 
devices  by  combining  these  two:  operation  of  a  large  array  at  high  speeds.  ITus  investigation  into  the 
implementation  of  high  speed,  highly  parallel  switching  has  revealed  the  practical  difficulties  described  below. 

We  have  used  a  1  W  cw,  argon  ion  pumped  Ti-sapphire  laser  operated  at  830  nm  and  an  electro-optic 
modulator  to  switch  a  single  16  by  32  array  of  GaAs/GaAlAs  symmetric-SEEDs  (S-SEEDs)[4J.  The  set-up 
used  is  shown  in  the  figure  and  photograph.  The  optics  in  this  system  are  kinematically  mounted  on  a  custom 
slot  baseplate.  The  electro-optic  modulator  (EOM)  and  two  polarising  beamsplitters  define  separate  routes  for 
the  s  and  p  polarisations,  producing  the  set  and  reset  beams.  These  two  angularly  separated  beams  pass 
through  a  16  by  32  binary  phase  grating  array  generator  and  are  relayed  to  the  8.4  mm  focal  length  objective 
lens  that  focuses  the  beams  into  5  pm  spots  on  the  device  windows.  The  design  is  telecentric.  A  fast  avalanche 
photodiode  system  and  high  bandwidth  oscilloscope  were  required  to  detect  the  reflected  beams  from  the  SEED 
devices. 

The  requirement  far  high  power  in  a  single  mode  prechides  die  use  of  an  easily  modulated  diode  laser, 
necessitating  the  use  of  an  externally  modulated  large  frame  laser.  The  external  modulator  had  the  highest 
optical  power  handling  and  speed  specification  available.  It  could  rotate  the  polarisation  to  produce  a  pseudo¬ 
square  wave  at  50  MHz.  which  resulted  in  a  contrast  ratio  for  the  beams  incident  on  the  SEEDs  of  6:1. 
However,  even  with  a  beam  as  large  as  the  1  mm  aperture  allowed,  significant  photo-refractive  damage 
accumulated,  producing  a  strong  focusing  effect 

Signal  encoding  is  a  major  issue  that  is  best  addressed  by  directly  modulating  the  laser  itself.  Current 
development  of  master  oscillator  power  amplifier  (MOPA)  devices[5]  is  resulting  in  compact  lasers  with  high 
power  single  mode  performance  that  should  soon  be  configurable  for  high  speed  modulation.  Our  experience 
confirms  that  the  problems  of  beam  drift,  cavity  alignment  and  the  difficulties  of  modulation  exclude  the 
practical  use  of  gas  lasers  for  this  type  of  system  demonstration.  This  laser  was  discovered  to  operate  multi¬ 
longitudinal  mode  with  beats  at  multiples  of 230  MHz,  which  clearly  diminishes  the  visibility  of  high  frequency 
switching.  Any  misalignment  of  the  cavity  degrades  the  beam  profile  and  introduces  transverse  mode  beating 
at  80  MHz  as  well  as  misaligning  the  modulator  and  hence  lowering  the  contrast  ratio. 

Operation  to  date  has  been  limited  by  the  EOM  and  laser.  Half  of  the  array  has  been  observed  switching  at 
5  MHz  (1.25  Gbit/s)  and  individual  pixels  at  10  MHz.  Operation  at  up  to  50  MHz  is  anticipated. 
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Figure  1  Schematic  Layout 
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Figure  2  Photograph  of  baseplate  and  optics 
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ABSTRACT 

Two  dimensional  (4x18)  arrays  of  Field  Effect  Transistor-Self  Electrooptic  Effect  Device  transimpedance 
receivers  have  been  fabricated  for  application  in  massively  parallel  optical  data  links.  Up  to  lOOMbps/channel 
was  demonstrated.  Test  results  are  discussed. 


SUMMARY 

Architectures  for  large  switching  systems  ate  constrained  by  the  limitations  of  available  interconnections  at  the 
chip  and  board  level.111  P1  Two  Dimensional  Optical  Data  Links  (2D-ODLs)131  [4)  could  provide  a  low  cost,  high 
bandwidth,  high  density,  interconnection  scheme  and  would  open  up  new  avenues  to  architects  of  these  systems. 
Key  components  used  to  construct  2D-ODLs  are  a  modulator  or  source  array,  a  2D-fiber  bundle,151  and  a  receiver 
array.  This  talk  describes  the  design,  and  experimental  results  of  4x18  differential  transimpedance  receiver 
arrays  (Figure  1)  fabricated  in  the  Field  Effect  Transistor-Self  Electrooptic  Effect  Device  (FET-SEED)161 
technology.  The  FET-SEED  technology  consists  of  multiple  quantum  well  (MQW)  reflection 
detectors/modulators171  (S1  monolithically  integrated  with  doped  channel  MIS-like  Field  Effect  Transistors 
(DMTs). 


Figure  1.  Photomicrographs  of  4x18  FET-SEED  differential  receiver  circuit  and  array 
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The  receiver  schematic  and  amplifier  response  histogram  are  shown  in  Figure  2.  Several  4x8  FET-SEED 
transimpedance  receiver  arrays  were  tested  and  displayed  a  mean  response  of  -0.7mV/|iW,  and  were  capable  of 
>100Mbps  per  channel  operation.  Eye  diagrams  for  40Mbps  and  100Mbps  operation  is  shown  in  Figure  3.  The 
mean  receiver  sensitivity  for  a  BER  of  <10~9  at  the  design  rate  of  40Mbps  was  -25dBm,  and  was  found  to  incur 
a  1/f  noise  penalty.  These  receiver  arrays  were  successfully  used  in  a  demonstration  of  a  Small  Computer 
System  Interface  (SCSI)  fully  differential  parallel  optical  data  link.  In  this  talk,  we  will  compare  the  measured 
bit  error  rate,  crosstalk,  and  noise  to  theoretical  values,  and  discuss  the  limitations  in  fabricating  large  arrays  of 
[XT  coupled  receivers. 


■ 

9 

m 

i 

“C 

£ 


RESPONSIVITY  HISTOGRAM 
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Figure  2.  Differential  transimpedance  amplifier  schematic  diagram,  and  responsivity  histogram 


Figure  3.  Eye  diagrams  for  40  and  100Mbps  operation 
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Abstract:  Some  of  the  achievements  of  National  863  High  Technology  Program  in  China 
are  summarised. 

1.  Introduction 

The  national  863  High  Technology  Program,  which  was  proposed  and  drew  up  by  scientists, 
has  been  implemented  in  China  since  March  1986.  Under  the  program  there  are  several 
groups  according  different  scientific  fields.  There  is  Optical  Computing  in  the  Group  of 
Opto-dectronic  Devices  and  Integration,  which  incorporates  more  than  twenty  universities 
and  institutes  that  have  made  achievements  in  optical  computing.  The  several  areas  of 
optical  computing  are  bong  implemented  as  follows. 

2.  Devices 

The  S-SEED  and  S-SEED  Array  have  been  made  and  applied  to  optical  computing.  Devices  as 
Multiple  Quantum-Well-Based  Lasers(MQW),  Top  Surface  Emitting  Vertical  Cavity  Diode 
Laser,  Multi-Channel-Splinder,  Micro  Lens  Array  ami  the  related  materials  (include 
semiconductor  and  crystal)  for  these  devices  have  been  also  developed. 

In  the  research  on  real  time  Spatial  Light  Modulators(SLM),  the  liquid  Crystal  Light  Valve 
(LCLV)  and  Electrically  Addressed  SLM  that  closely  coupled  with  CRT,  Ferro  electric 
Liquid  Crystal  Valve  and  Pockel-Read  Optical  Modulators(PROM)  have  been 
implemented. 

3.  Architecture  and  System 

The  optoelectronic  hybrid  processor's  array  consists  of  three  2DPAs  (2-D  Processors  Array), 
which  has  3-D  architecture.  A  programmable,  electrically  addressed  optical-fiber  inter¬ 
connection  network  has  been  employed  between  each  two  2DPA.  This  system  can  be  used  tor 
parallel  processing. 

In  the  research  on  Morphological  processors,  the  varied  operations  in  image  algebra  are 
realised  by  means  of  hardware,  which  have  been  applied  to  image  processing.  The 
demonstration  of  image  processing  in  real  time  has  beat  made  by  Optical  Cellular  Array 
processors  developed. 


The  adaptive  processors  that  based  on  the  muhi-Channel  JTC  System  and  optical  disc 
memory  system  have  been  developed  to  carry  out  the  image  correlation  processing. 
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The  modules  of  interconnection  and  parallel  processing  system  based  on  S-SEED  will  be 
used  as  the  demonstration  system  of  opto-digital  computing  and  a  kind  of  special-purpose 
system. 

4.  Neural  Network  Processors 

In  the  research  on  optoelectronic  hybrid  neural  network,  more  than  1000  neural  cells  have 
been  reached  for  optical  computing,  where  the  threshold  is  done  by  electrical  computers  to 
implement  varied  kinds  of  operations. 

In  the  studies  of  optoelectronic  hybrid  associative  memory,  the  storage  devices  are  made  up 
with  the  new  kind  of  Crystal,  KNSBN,  a  Liquid  Crystal  screen  with  changeable  thresholds  is 
employed  as  a  device  of  threshold  and  gain.  Associativity  of  the  system  is  as  high  as  75%. 
The  studies  of  associative  storage  system  with  multi-channels  are  going  on  to  increase  the 
processing  speed. 

5.  The  Fiber  Network  of  Computers 

The  optical  fiber  network  of  multiple  micro  computers  has  been  studying  and  been  in 
progress.  The  products  of  this  kind  of  system  with  the  resources  shared  by  multi-computers 
will  be  developed  soon. 

6.  Conclusions 

(1) under  the  united  organisation  all  the  studies  are  cooperately  going  on. 

(2) There  are  some  differences  between  the  National  Nature  Academic  Fund,  which  mainly 
supports  the  studies  of  areas  such  as  the  new  ideas,  new  architecture's  and  new  kinds  of 
devices  etc.,  and  National  863  Program,  winch  lays  particular  emphasis  on  the  creative 
researches  in  practical  applications. 

(3) The  first  stage  of  this  program  will  end  by  2000.  When  the  time  comes,  there  will  be  some 
available  optical  computing  systems. 
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I.  INTRODUCTION 

Integrated  optical  devices  such  as  surface  emit¬ 
ting  laser  array,  photo  detector  array  and  OEICs 
promise  high  performance  parallel  processing.  Al¬ 
though  these  devices  have  two-dimensional  paral¬ 
lelism  for  pattern  information  processing,  their  po¬ 
tential  capabilities  are  not  utilized  in  conventional 
parallel  processing  systems.  The  ”1/0  bottleneck” 
between  processing  element  (PE)  array  and  I/O 
devices  sets  the  limit  to  the  processing  speed. 

In  order  to  overcome  the  I/O  bottleneck,  two- 
dimensional  interconections  between  them  are  re¬ 
quired.  The  interconnections,  however,  can  not 
be  implemented  by  using  conventional  macro-scale 
wiring  technology  even  if  the  integrated  optical  de¬ 
vices  are  used.  Only  the  integrated  wiring  tech¬ 
nology  that  is  VLSI  technology  can  implement  the 
interconnections.  It  means  that  the  processing  ele¬ 
ment  should  be  so  compact  to  be  implemented  with 
integrated  optical  devices.  Integrated  PE  and  op¬ 
tical  device  array  realizes  fully  parallel  processing 
with  high  computing  performance. 

In  this  paper,  architectures  of  compact  PE  ar¬ 
ray  and  experimental  systems,  some  experimental 
results  on  the  system  performance  for  applications, 
and  a  design  of  an  intergrated  one  chip  parallel  pro¬ 
cessing  system  are  shown. 

II.  PARALLEL  OPTOELECTRONIC 
PROCESSING  SYSTEMS 

A.  Design  of  processing  element 

Conventional  microprocessors  can  not  be  used 
as  the  PE  of  the  system,  because  the  number  of 
gates  is  too  large  that  many  PEs  cannot  be  im¬ 
plemented  into  small  area  on  VLSIs.  Keys  of  the 
design  are  how  generality  of  processing  is  kept  by 
using  8 mall  number  of  gates  and  how  integration 
and  high  speed  processing  is  performed. 

Ishikawa  et  al.  have  proposed  a  compact  PE  ar¬ 
chitecture  with  general  purpose  and  programable 
functionality  and  implemented  a  scale-up  model  [1]. 
A  conceptual  diagram  based  on  the  architecture 
with  optical  interconnection  is  shown  in  Fig.l. 

The  architecture  of  the  PE  has  following  features 
to  be  compact;  1)  direct  connection  between  PE 
and  optical  I/O,  2)  SIMD  type  parallel  processing, 


1  proc— mg  clement 


Fig.  1.  Parallel  optoelectronic  processing  system 


3)  controlled  by  microinstruction,  4)  bit  serial  pro¬ 
cessing,  5)  restricted  electrical  interconnections  (4 
neibours). 

Each  PE  has  three  8bit  registers,  one  arithmetic 
logical  unit  (ALU,  lbit),  one  4bit  multiplier  as 
shown  in  Fig.  2.  Although  bit  serial  processing  is 
slow  in  comparison  with  bit  parallel  processing,  it 
is  so  compact  to  realize  the  integration  and  variable 
bit  length  processing. 

In  the  result,  the  PE  is  implemented  by  using 
337  gates.  The  number  of  the  gates  is  quite  small. 

B.  Scale  up  model  :  SPE-jk 

An  experimental  optoelectronic  processing  sys¬ 
tem  which  has  matrix  positioned  64  x  64  =  4096 
(4k)  PEs  is  implemented  by  using  gate  array  tech¬ 
nology.  The  system  is  named  SPE-4k  (sensory  pro¬ 
cessing  elements  -  4k)  and  is  designed  as  a  scale  up 
model  of  integrated  one  chip  optoelectronic  pro¬ 
cessing  systems  shown  in  Fig.  1.  The  system  uses 
LEDs  and  PTRs  for  optical  I/O. 
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Fig.  2.  Block  diagram  of  processing  element 


The  cycle  time  of  the  system  is  100ns.  Consider¬ 
ing  the  8bit  integer  addition  as  a  basic  operation  of 
processing  for  the  evaluation  of  the  speed  of  SPE- 
4k,  3.2GOPS  (Giga  Operations  Par  Second)  at  the 
maximum  speed  of  the  system  are  obtained. 

C.  Reconfigurable  optical  interconnection 

Shift-invariant  and  reconfigurable  optical  inter¬ 
connection  between  the  PEs  of  SPE-4k  using 
Fourier  plane  computer  generated  hologram  (CGH) 
has  been  proposed  and  demonstrated  by  using  a 
parallel  aligned  nematic  liquid  crystal  (PAL)  sap- 
tial  light  modulator  (PALSLM)  by  Kirk  et  al.  [2]. 
Since  the  CGH  is  optimized  and  the  PALSLM  is 
a  phase  modulation  type  spatial  light  modulator, 
high  diffraction  efficiency  and  accuracy  have  been 
obtained. 

Recently  a  laser  diode  array  for  integration  of  the 
optical  interconnection  and  Fourier  plane  hologram 
for  real-time  generation  of  the  interconnection  ker¬ 
nel  are  examined  and  practical  algorisms  using  op¬ 
tical  interconnection  for  parallel  processing  opera¬ 
tions  such  as  prefix  calculation  and  vector-matrix 
multiplication  are  also  examined. 

D.  One  chip  system 

The  number  of  transisters  can  be  reduced  by  us¬ 
ing  full  custom  VLSI  design  rule.  Yam&da  and 
Ishikawa  have  designed  and  implemented  one  chip 
version  of  PE  array  with  photo  array  (3j.  In  their 


TABLE  I 

Processing  Time  of  Applications 


Processing 

Steps 

Time 

Edge  detection  (binary) 

(ExOR  with  4  neghboun) 

33 

3.3xs 

Skettcnixatioc 
(iteration,  4  neighbours) 

149 

149.0ns 

Gaussian 

111 

11. 1X8 

Matched  Filter  (binary) 

89 

8.9xs 

Optical  flow 
(iteration) 

672 

13.44as 

Poisson's  equation 
(iteration) 

125 

2.5ns 

chip,  3600  PEs  with  the  same  number  of  photo  de¬ 
tectors  are  integrated  as  shown  in  Fig.  3. 

III.  APPLICATIONS 

This  type  of  parallel  processing  is  matched  with 
early  stage  of  visual  information  processing.  In  or¬ 
der  to  evaluate  processing  performance  of  the  SPE- 
4k,  some  applications  have  been  carried  out  on  the 
system.  The  processing  time  of  the  applications  is 
shown  in  Table  I. 

In  principle,  input  pattern  is  binary  and  calcu¬ 
lation  is  in  fixed  point.  A  few  applications  in  Table 
I  use  iterative  algorism,  so  in  such  applications,  the 
processing  time  depends  on  the  iteration. 

IV.  CONCLUSION 

Optoelectronic  processing  architectures  for  real¬ 
izing  a  integrated  optoelectronic  systems  and  some 
experimental  results  are  shown. 

In  the  SPE-4k,  compact  processing  elements  re¬ 
alize  general  early  vision  processing  and  high  speed 
visual  feedback.  In  addition,  it  is  shown  that  3600 
PEs  with  the  same  number  of  photo  detectors  can 
be  implemented  into  one  chip. 
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Abstract:  We  have  constructed  a  simple  4-bit  counter  using  holographic  interconnects, 
microlens  arrays,  vertical  cavity  lasers,  and  a  CMOS  detector  array  chip.  Performance  of  this 
prototype  system  will  be  discussed. 

We  are  working  on  various  architectures  for  three-dimensional  optoelectronic  computers. 
They  all  share  free  space  holographic  interconnects  from  one  optoelectronic  processing  board 
to  another.  We  will  describe  die  results  of  an  experiment  to  construct  a  system  using  this 
technology.  This  is  also  the  first  step  in  the  development  of  a  testbed  for  evaluation  of  source 
and  detector  arrays,  holograms  and  die  necessary  optomechanics. 

Figure  1  is  a  schematic  of  the  experiment  The  system  implements  a  simple  4-bit  counter 
with  the  right  (left)  half  of  the  system  counting  the  odd  (even)  numbers.  For  the  top  beams, 
there  are  four  pairwise  optical  OR's  performed  by  the  receiver  detectors.  The  electronics  are  a 
few  discrete  TTL  gates  that  arc  used  to  implement  the  Boolean  expressions  for  the  four  bits  of  a 
counter.  On  each  clock  signal  the  right  (left)  counts  1,  3,  5,  7,  9,  11,  13,  15,  1...  (0,  2, 4,  6, 
8,  10,  12,  14,  0...)  etc. 

A  pair  of  Vertical  Cavity  Surface  Emitting  Laser  arrays(VCSELs),  a  1x8  and  a  1x4, 
emitting  at  842  nm,  are  the  sources.  The  VCSEL  output  powers  vary  from  550  pW  to  900 
pW,  with  bandwidths  in  the  GHz.  The  VCSEL  is  cooled  by  a  thermoelectric  cooler  to  control 
the  wavelengths. 

A  Nippon  Sheet  Glass  company  microlens  array  (pLA)  with  a  560  pm  focal  length  is 
glued  dirccdy  to  the  VCSEL  package  while  actively  monitoring  the  beam  outputs.  The  pLA 
collimates  the  Gaussian  output  beams  of  the  VCSELs. 

Each  of  the  4  or  8  channels  illuminates  a  separate  computer  generated  hologram  (CGH), 
which  creates  the  desired  interconnect  pattern.  These  are  designed  using  a  modified 
Gerchberg- Saxton  algorithm.  Two  types  of  elements  are  produced  from  this:  the  first  is  a 
standard  etched  binary  (two  level)  phase  grating  in  quartz.  The  second  is  a  phase  copy  of  the 
amplitude  mask  recorded  at  5 14  nm  in  Dupont  photopolymer.  The  performance  characteristics 
of  these  approaches  will  be  presented.  The  hologram  array  (HA)  is  glued  to  the  pLA  while 
actively  monitoring  the  beam  outputs. 

The  transform  lens  is  a  standard  laser  diode  lens  anti-reflection  coated  for  830  nm. 

Standard  2  pm  process  CMOS  chips  made  by  the  MOSIS  chip  fabrication  service  are 
used  as  2x8  pn  diode  detector  arrays  with  200x200  pm2  detection  area.  Each  channel  has 
amplifiers  and  a  TTL  level  output  driver.  The  receivers  require  optical  pulse  power  >40  pW  to 
generate  an  electrical  output  pulse.  These  outputs  are  wired  to  the  TTL  chips. 

All  array  elements  are  on  a  250  pm  pitch. 
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The  components  arc  mounted  with  Spindler  and  Hoyer  microbench  parts  with  the 
VCSEL/pLA/HA  component  mounted  on  a  slug  held  by  the  standard  x-y  adjustable  stage  and 
similarly  for  the  receiver  chips.  There  are  two  independent  optical  paths  as  shown  in  the 
figure. 

The  system  speed  is  limited  by  the  minimum  power  required  for  the  receiver  chips  to 
generate  an  output  pulse.  This  in  turn  depends  on  the  efficiency  of  the  holograms,  and  the 
VCSEL  output  powers.  Another  issue  relating  to  speed  is  the  absorption  depth  of  the  light 
We  measured  the  speed  and  sensitivity  of  the  detector  chips  at  both  830  nm  and  at  670  nm  to 
separate  the  circuit  speed  limitations  from  the  diffusion  time  delay  for  carriers  absorbed  deep 
inside  the  silicon  substrate.  The  implications  of  these  interconnected  factors  will  be  discussed 
in  terms  of  the  requirements  placed  on  each  element  for  reliable  system  design. 

The  complete  system  performance  and  implications  for  larger  systems  will  be  discussed. 


Micro  Holograms 

VCSRI  *  Uns  1  ~  Receivers 


Receivers  |  Micro  VCSELs 

Holograms  lens 
Figure  1.  Experimental  schematic. 
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1  Abstract 

An  optical  image  processor  with  optical  feedback  and  gray  scale  capability  compactly  organized  around 
bistable  binary  amorphous  silicon  and  ferroelectric  liquid  crystal  devices  and  a  one  lens  correlator  using 
reflective  multiplexed  photopolymer  hologram  is  presented. 

2  Summary 

A  broad  class  of  efficient  and  widely  used  low  level  image  processing  operations  are  the  non-linear  filters 
such  as  rank-ordered  filtering  and  morphological  operations.  They  can  be  performed  on  binary  images 
by  convolving  with  a  binary  or  gray  scale  kernel  and  thresholding  the  output.  Extension  to  gray  scale 
images  is  performed  by  considering  them  as  a  finite  collection  of  binary  slices,  processing  each  slice  as 
previously  and  finally  summing  the  processed  slices  to  obtain  the  processed  gray  scale  image  [lj. 

As  they  involve  mainly  a  correlation,  these  processes  may  be  advantageously  implemented  optically 
with  thresholding  binary  optoelectronic  devices.  As  even  the  simplest  processes  involve  many  sequential 
basic  operations,  a  strictly  parallel  optical  feedback  is  necessary.  Pipelining  of  optical  processors  quickly 
becomes  very  expensive  and  using  electronic  feedback  introduces  an  10  bottleneck  which  destroys  the 
advantage  of  optics.  The  correlator  must  be  multiplexed  or  quickly  reconfigurable.  The  2D  optoelectronic 
devices  must  have  memory  capability  to  control  the  data  flow  in  the  optical  loop,  cascadable  and  able 
to  perform  a  sharp  thresholding  at  a  programmable  level.  Additionally,  the  processor  must  be  compact 
and  inexpensive.  Under  such  stringent  constraints,  very  few  realistic  solutions  have  been  found.  Here  we 
propose  a  system  using  recently  available  devices. 

Optically  addressed  spatial  light  modulators  (OASLM)  using  Ferroelectric  Liquid  Crystal  (FLC)  are 
considered  here.  Although  FLC  and  silicon  VLSI  based  smart  pixel  [2]  arrays  allow  an  interesting  versa¬ 
tility  at  the  pixel  level,  the  processor  considered  here  is  based  on  equivalent  resolution  but  higher  optical 
quality  and  lower  cost  devices  which  use  a  hydrogenated  amorphous  silicon  (aSi:H)  continuous  thin  film 
as  substrate. 

The  device  is  a  mosaic  of  nine  independently  driven  aSi:H/FLC  bistable  OASLMs  with  fixed  high 
threshold  [3]  (  250  /iW/mm3  at  633  nm  and  2.5  kHz  frame  rate)  realized  on  the  same  optical  fiats.  It  is 
illuminated  on  both  sides  through  the  polarizing  beam  splitters  CBS  and  FBS  (see  figure).  The  aSi:H/FLC 

'Pierre  Cambon  ia  with  Opt.  Comp.  Sjre.  Center,  Uni*,  of  Colorado,  phone  (303)  493  5005  until  May  31, 1994.  phone: 
(303)  493  5006,  E-mail:  >«»ilimaila/«lmA<^ln 
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interface  is  reflective  (pixels ted  aluminium).  The  central  OASLM  faces  in  the  opposite  direction  compared 
to  the  peripheral  ones.  The  right  side  performs  the  feedback  by  using  a  lens  (FL),  a  plane  facet  mirror 
(FM)  and  independently  controlled  feedback  laser  diodes  (Fid).  The  left  side  performs  correlation  from 
the  central  OASLM  to  the  peripheral  one’s  by  using  a  power  source  (PS)  and  variable  light  biasing  (Tld) 
for  threshold  control  and  erasing.  The  Alter  is  a  multiplexed  reflective  hologram  recorded  in-situ  using 
recent  Du  Pont  photopolymer  materials[4]. 

The  gray  level  input  image  (In)  is  formed  by  input  lens  (IL)  on  the  central  OASLM.  A  biasing  light 
(from  Tld  through  the  mirror  FM)  is  incoherently  added  to  the  image  to  determine  the  threshold  level  and 
the  result  thresholded  and  stored  as  a  binary  image  in  the  FLC.  Using  the  power  source  (PS)  this  slice  is 
convolved  simultaneously  with  the  eight  kernels  recorded  in  hologram  (RMH)  chosen  for  the  application. 
One  or  a  few  of  the  convolved  images  are  stored  on  a  peripheral  OASLM  while  the  previously  stored 
images  on  the  other  OASLMs  are  retained.  The  other  slices  of  the  images  are  successively  formed  and 
processed  in  the  same  way  and  stored  in  peripheral  OASLMs.  The  final  summing  operation  is  performed 
by  illuminating  simultaneously  with  the  feed-back  diodes  the  OASLMs  where  the  previously  processed 
slices  are  stored.  The  input  image  is  obstructed  using  the  shutter  SH.  The  gray  level  output  image  is 
directed  onto  the  “Out”  plane  by  activating  a  quarter  wavelength  FLC  plate  (QP)  which  rotates  the 
polarization  of  the  reflected  beam.  Binary  image  processing  can  be  performed  in  the  same  way  and 
operations  are  easily  cascaded  by  using  the  feedback  diodes. 

Design,  scaling,  power  budget  and  technology  are  presented.  Performance  simulations,  partial  experi¬ 
ments  and  controlling  programs  exemplified  by  a  cytology  diagnosis  application  requiring  only  four  slices 
of  256  X  256  images  are  shown. 
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Abstract 

Details  of  the  implementation  of  two  optical  computer  demonstrations  will  be  presented: 
a  64-channel  S-SEED  cellular  logic  image  processor  with  a  dynamic  interconnect  and  a  sorting 
module  implemented  by  interconnecting  smart  pixels  with  a  shuffle. 

Summary 

Dynamic  interconnect  0-CUP:  This  is  a  demonstration  of  a  8x8  channel,  S-SEED  optical 
cellular  logic  image  processor  using  hybrid  lenses  and  a  dynamic  interconnect.  Two  arrays  of 
S-SEEDs  are  used  and  are  interconnected  to  each  other  as  shown  schematically  in  the 
photograph  (figure  1).  The  interconnect  used  is  an  electrically-addressed  Seiko  Epson  phase 
modulator(not  shown  in  photo).  For  optimum  efficiency,  it  is  essential  that  the  CGH  period  is 
an  integer  multiple  of  the  SLM  pixel  pitch  (46  pm).  The  pixel  pitch  of  the  microlens/S-SEED 
was  therefore  chosen  to  be  160  pm  and  a  41.9  mm  focal  length  lens  was  used.  The  interconnect 
can  i)  map  one-to-one,  ii)  fan-out  by  3  horizontally  to  straight  on  and  two  nearest 
neigh  bouis(460  pm  period),  iii)  fan-out  to  next  nearest  neighbours  and  straight(230  pm).  With 
these  reconfigurable  interconnects  it  is  possible  to  create  a  compact  system  with  interesting 
processing  capabilities. 

One  point  of  the  demo  is  to  show  that  a  low -power  high-field  macrolens  in  combination 
with  a  high-power  low-field  microlens  produces  the  high-power  high-field  hybrid  lens  which 
is  required  to  interface  with  a  large  array  of  widely  separated  smart  pixels  with  small  windows. 
The  microlens  needs  to  have  field  of  30  pm  to  allow  both  dual-rail  beams  required  to  read  and 
set  the  state  of  a  S-SEED  to  pass  through  a  single  microlens.  A  singlet  microlens  is  inadequate 
so  an  afocal  relay  consisting  of  f/3  and  f/l  doublets  has  been  fabricated.  This  is  also  required 
for  the  magnification  step  between  the  15pm  spot  produced  by  the  low  power  lens  and  the 
sub-5pm  spot  required  for  the  S-SEED.  The  in-house  macrolens  used  is  a  f/5  triplet  with  a  10 
degree  full-field  (7.3mm).  Full  details  will  be  presented  of  the  characterization  of  system 
components  and  its  operation. 

Sorting  Module:  Another  demonstration  system  currently  under  construction  is  a  system 
which  performs  the  bitonic  sort  algorithm  using  smart  pixels  interconnected  by  a  2-D  perfect 
shuffle  (figure  2).  The  perfect  shuffle  module  is  based  on  an  approach  adapted  from  Goonan[l] 
which  uses  a  2x2  fan-out  grating  followed  by  a  2x  telescope.  The  smart  pixels  required  for  this 
system  consist  of  an  array  of  pixels  with  the  following  functionality:  a  self-routing 
exchange/bypass  node  with  a  latch,  optically  programmable  to  sort  the  higher  or  lower  of  the 
two  inputs  to  either  output  or  allow  a  straight  connection  with  no  comparison,  optical  clock,  and 
4-bit  shift  registers  at  both  outputs.  A  FET-SEED  smart  pixel  has  been  designed  to  perform  this 
function.  A  similar  but  less  powerful  F-SEED  has  been  fabricated  and  results  of  tests  on  it  will 
be  presented.  In  addition,  a  CMOS  version  of  this  smart  pixel  has  been  designed  and  is 
currently  being  fabricated.  A  SPICE  simulation  has  shown  that  it  should  operate  at  100MHz. 
Results  will  be  presented  of  a  hybrid  smart  pixel  array  which  will  be  fabricated  by  bonding  an 
InGaAs  modulator  array  to  a  CMOS  circuit 

[1]  T.  J.  Cloonan  et  al,  "A  complexity  analysis  of  smart  pixel  switching  nodes  for  photonic  EGS 
switching  networks,"  IEEE  J.  QE  29  619  (1993) 
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Figure  1:  Dynamic  interconnect  O-CLIP 
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